If you happen to be looking for bad redactions in a large set of data files today for some reason, there's an open source tool for that.
If you happen to be looking for bad redactions in a large set of data files today for some reason, there's an open source tool for that.
@evacide interesting, however there could perhaps be edge cases it misses such as non-rectangular regions
another good tool to use would be smth that just extracts all the text in a pdf document which you can then grep against
@evacide PDF is a truly horrible format inside, an absolute nightmare to edit. That is why even simple redactions are so difficult. Also why it breaks screen readers.
Each page is actually a little program in a language related to but not the same as postscript, containing instructions to draw the page. Instructions in an arbitrary order decided by the program that generated the PDF, which bears no connection the reading order or layout of the document.
Such a useful tool.
On an unrelated note. I hope its use will not be in a manner that a legal team could come after them for attempting to gain access to classified information or to expose victims.
Stay safe!