Electronic document content redaction
First Claim
Patent Images
1. A method, comprising:
- identifying, by a computing device, two or more layers in an electronic document;
processing a first layer of the identified layers, by performing optical character recognition of one or more images comprised by the first layer to produce a first layer text comprising one or more character strings representing respective images;
processing a second layer of the identified layers, to produce a second layer text;
combining the first layer text and the second layer text to produce a combined text of the electronic document;
identifying, within the combined text of the electronic document, a target character string corresponding, in view of a specified search function, to a specified character string;
redacting, from the electronic document, the target character string;
visually rendering the combined text of the electronic document; and
receiving an input confirming redaction of the target character string from the combined text of the electronic document.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for redacting certain content (e.g., content representing private, privileged, confidential, or otherwise sensitive information) from electronic documents. An example method comprises: identifying, by a computing device, two or more layers in an electronic document; processing each of the identified layers to produce a layer text representing one or more objects comprised by the layer; combining the produced layer texts to produce a combined text of the electronic document; and identifying, within the combined text of the electronic document, a target character string corresponding, in view of a specified search function, to a specified character string.
-
Citations
10 Claims
-
1. A method, comprising:
-
identifying, by a computing device, two or more layers in an electronic document; processing a first layer of the identified layers, by performing optical character recognition of one or more images comprised by the first layer to produce a first layer text comprising one or more character strings representing respective images; processing a second layer of the identified layers, to produce a second layer text; combining the first layer text and the second layer text to produce a combined text of the electronic document; identifying, within the combined text of the electronic document, a target character string corresponding, in view of a specified search function, to a specified character string; redacting, from the electronic document, the target character string; visually rendering the combined text of the electronic document; and receiving an input confirming redaction of the target character string from the combined text of the electronic document. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computing device, comprising:
-
a memory; a processor, coupled to the memory, to; identify two or more layers in an electronic document; process a first layer of the identified layers, by performing optical character recognition of one or more images comprised by the first layer to produce a first layer text comprising one or more character strings representing respective images; process a second layer of the identified layers, to produce a second layer text; combine the first layer text and the second layer text to produce a combined text of the electronic document; identify, within the combined text of the electronic document, a target character string corresponding, in view of a specified search function, to a specified character string; redact, from the electronic document, the target character string; visually render the combined text of the electronic document; and receive an input confirming redaction of the target character string from the combined text of the electronic document. - View Dependent Claims (7, 8)
-
-
9. A computer-readable non-transitory storage medium comprising executable instructions that, when executed by a computing device, cause the computing device to perform operations comprising:
-
identifying two or more layers in an electronic document; processing a first layer of the identified layers, by performing optical character recognition of one or more images comprised by the first layer to produce a first layer text comprising one or more character strings representing respective images; processing a second layer of the identified layers, to produce a second layer text; combining the first layer text and the second layer text to produce a combined text of the electronic document; identifying, within the combined text of the electronic document, a target character string corresponding, in view of a specified search function, to a specified character string; redacting, from the electronic document, the target character string; visually rendering the combined text of the electronic document; and receiving an input confirming redaction of the target character string from the combined text of the electronic document. - View Dependent Claims (10)
-
Specification