Method and apparatus to provide a unified redaction system
First Claim
Patent Images
1. A method comprising:
- enabling a user to define a redaction of a part of a document in a corpus of documents, the redaction definition including a scope defining a range of documents in the corpus to which the redaction applies, wherein the document is produced as a bitmap image file in which a redacted region appears as a region of black pixel data;
creating a temporary image file representing an unredacted version of the document;
creating a temporary image file representing a redacted version of the document utilizing custom fonts in rendering which result in foreground and background colors of redaction regions being an inverse of the foreground and the background colors of fonts used for non-redaction regions;
performing XOR operations between corresponding sections in the temporary image files of the unredacted and the redacted documents; and
creating a mapping between a redacted token and pixel space bounds of the redacted token, thereby creating the region of black pixel data.
8 Assignments
0 Petitions
Accused Products
Abstract
A method comprising enabling a user to define a redaction of a part of a document in a corpus of documents, the redaction definition including a scope defining a range of documents in the corpus to which the redaction applies.
156 Citations
58 Claims
-
1. A method comprising:
-
enabling a user to define a redaction of a part of a document in a corpus of documents, the redaction definition including a scope defining a range of documents in the corpus to which the redaction applies, wherein the document is produced as a bitmap image file in which a redacted region appears as a region of black pixel data; creating a temporary image file representing an unredacted version of the document; creating a temporary image file representing a redacted version of the document utilizing custom fonts in rendering which result in foreground and background colors of redaction regions being an inverse of the foreground and the background colors of fonts used for non-redaction regions; performing XOR operations between corresponding sections in the temporary image files of the unredacted and the redacted documents; and creating a mapping between a redacted token and pixel space bounds of the redacted token, thereby creating the region of black pixel data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55)
-
-
56. A method comprising:
-
enabling a user to define a redaction of a part of a document in a corpus of documents, the redaction definition including a scope defining a range of documents in the corpus to which the redaction applies, wherein the document is produced as a bitmap image file in which a redacted text appears as a region of black pixel data; creating a temporary image file representing an unredacted version of the document; creating a temporary image file representing a redacted version of the document utilizing a character transposition on the redacted text that is calculated to match rendered bounds of the unredacted version of the text; using a region detection process which accounts for page orientation and text reading direction to produce a pixel space representation of relevant text tokens on any given page of the document; performing XOR operations between each corresponding section in the image files of the unredacted and redacted versions of the document; grouping differences produced by the XOR operations into the detected token regions; filling the affected token regions with black pixel data; and creating a mapping between a redacted token and pixel space bounds of the redacted token. - View Dependent Claims (57, 58)
-
Specification