Method, system and computer program for redaction of material from documents
First Claim
Patent Images
1. A computer implemented method of redacting content from a document in portable document format (PDF) comprising a PDF data stream, comprising the steps of:
- selecting a first geometric area on the document for redaction, said first geometric area having content comprising at least one image;
selecting a second geometric area on the document for redaction, said second geometric area having content comprising text;
representing said geometric areas as annotation objects;
parsing said document into one or more content objects representing content and location and nature of content in said document, said one or more content objects comprising one or more text occurrence objects and one or more image occurrence objects;
identifying content from said one or more content objects having the same geometric location as said annotation objects; and
creating an output PDF file comprising said PDF data stream except for portions of said PDF data stream corresponding to said identified content, wherein a redacted document is producible from said output PDF file,wherein said selecting step comprises;
displaying all or a portion of the document; and
manipulating a movable viewing frame superimposed on the displayed document, content having a geographic location within said frame being visible to the user during said step of manipulation.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of redacting content from a document in electronic form includes the steps of selecting a geometric area on the document for redaction, representing the selected geometric area as one or more annotation objects, identifying information in the document representing content and location and nature of content, representing the identified information as one or more content objects, identifying content having the same geometric location as the annotation objects and creating a file with the identified content removed to produce a redacted document.
88 Citations
14 Claims
-
1. A computer implemented method of redacting content from a document in portable document format (PDF) comprising a PDF data stream, comprising the steps of:
-
selecting a first geometric area on the document for redaction, said first geometric area having content comprising at least one image; selecting a second geometric area on the document for redaction, said second geometric area having content comprising text; representing said geometric areas as annotation objects; parsing said document into one or more content objects representing content and location and nature of content in said document, said one or more content objects comprising one or more text occurrence objects and one or more image occurrence objects; identifying content from said one or more content objects having the same geometric location as said annotation objects; and creating an output PDF file comprising said PDF data stream except for portions of said PDF data stream corresponding to said identified content, wherein a redacted document is producible from said output PDF file, wherein said selecting step comprises; displaying all or a portion of the document; and manipulating a movable viewing frame superimposed on the displayed document, content having a geographic location within said frame being visible to the user during said step of manipulation. - View Dependent Claims (2, 3)
-
-
4. A method of redacting content from a document in portable document format (PDF) comprising a PDF data stream, comprising the steps of:
-
selecting at least one geometric area on the document for redaction, said geometric area having content comprising at least one image and text; representing said geometric area as one or more annotation objects; identifying information in the document representing content and location and nature of content; representing said identified information as one or more content objects, said one or more content objects comprising one or more image occurrence objects and one or more text occurrence objects; identifying content having the same geometric location as said annotation objects; removing said identified content; and creating an electronic output PDF file comprising said PDF data stream except for portions of said PDF data stream corresponding to said identified content, a redacted document being producible from said output PDF file for display, wherein said information identifying step comprises; displaying all or a portion of the document; and manipulating a movable viewing frame superimposed on the displayed document, content having a geographic location within said frame being visible to the user during said step of manipulation. - View Dependent Claims (5, 6)
-
-
7. A storage medium having stored therein a plurality of instructions, wherein the plurality of instructions, when executed by a processor, cause the processor to perform the steps of:
-
permitting a user to select at least one geometric area on a document for redaction, said geometric area having content comprising at least one image and text, said document being in portable document format (PDF) comprising a PDF data stream; representing said geometric area as one or more annotation objects; identifying information in the document representing content and location and nature of content, said content comprising at least one image and text; representing said identified information as one or more content objects, said one or more content objects comprising one or more image occurrence objects and one or more text occurrence objects; identifying content having the same geometric location as said annotation objects; removing said identified content; and creating an electronic output PDF file comprising said PDF data stream except for portions of said PDF data stream corresponding to said identified content, a redacted document being producible from said output PDF file, wherein the information identifying step comprises; displaying all or a portion of the document; and manipulating a movable viewing frame superimposed on the displayed document, content having a geographic location within said frame being visible to the user during said step of manipulation. - View Dependent Claims (8, 9, 10)
-
-
11. A system for redacting content from a document in portable document format (PDF) comprising a PDF data stream, comprising:
-
means for permitting a user to select at least one geometric area on the document for redaction, said geometric area having content comprising at least one image and text; means for representing said geometric area as one or more annotation objects; means for identifying information in the document representing content and location and nature of content, said content comprising at least one image and text; means for representing said identified information as one or more content objects, said one or more content objects comprising one or more image occurrence objects and one or more text occurrence objects; and means for identifying content having the same geometric location as said annotation objects; means for removing said identified content, and means for creating an electronic output PDF file comprising said PDF data stream except for portions of said PDF data stream corresponding to said identified content, a redacted document being producible from said output file, wherein the information identifying means comprises; means for displaying all or a portion of the document; and means for manipulating a movable viewing frame superimposed on the displayed document, content having a geographic location within said frame being visible to the user during said step of manipulation. - View Dependent Claims (12, 13, 14)
-
Specification