An ongoing problem we encounter as we gather court data is that people routinely fail to properly redact documents. Instead of doing it the right way,

X-Ray Bad Redaction Detector

submited by
Style Pass
2024-04-24 21:00:18

An ongoing problem we encounter as we gather court data is that people routinely fail to properly redact documents. Instead of doing it the right way, people draw a black rectangle or a black highlight on top of black text.

When this happens it is trivial to reveal the badly redacted text under the rectangle. To do so, you simply select the text that remains in the document and copy/paste it somewhere else.

We have run X-Ray across millions of PDFs in our system and are using the results of that research to educate the public about the prevalence of this problem.

By releasing this tool as a well-maintained open source utility, we are making it as easy as possible for law firms, courts, and others to get ahead of this problem, before yet another badly-redacted document is made public.

At present, X-Ray supports only the most basic (and most common) type of bad redaction, rectangles on top of text. There are a variety of other types of bad redaction though, and we hope to add additional features as this tool gains more usage.

Leave a Comment