USER CORRECTION OF ERRORS ARISING IN A TEXTUAL DOCUMENT UNDERGOING OPTICAL CHARACTER RECOGNITION (OCR) PROCESS
First Claim
1. An image processing apparatus for performing optical character recognition, comprising:
- an input component for receiving a textual image of a document;
a segmentation component for detecting text and images in the document and identifying word positions;
a reading order component for arranging words into textual regions and arranging the textual regions in a correct reading order;
a text recognition component for recognizing words and computing text properties concerning individual words and textual lines;
a paragraph detection component for arranging textual lines which have been identified in the textual regions into paragraphs;
a user interface through which the user provides user input data, wherein the user input data corrects a first mischaracterized item appearing in the document after undergoing OCR; and
an error correction component for receiving the user input data and causing a first of the components in which an initial error producing the first mischaracterized item arose to correct the initial error, wherein the error correction component is further configured to cause components that process the image subsequent to the first component to correct consequential errors arising as a result of the initial error.
2 Assignments
0 Petitions
Accused Products
Abstract
An electronic model of the image document is created by undergoing an OCR process. The electronic model includes elements (e.g., words, text lines, paragraphs, images) of the image document that have been determined by each of a plurality of sequentially executed stages in the OCR process. The electronic model serves as input information which is supplied to each of the stages by a previous stage that processed the image document. A graphical user interface is presented to the user so that the user can provide user input data correcting a mischaracterized item appearing in the document. Based on the user input data, the processing stage which produced the initial error that gave rise to the mischaracterized item corrects the initial error. Stages of the OCR process subsequent to this stage then correct any consequential errors arising in their respective stages as a result of the initial error.
-
Citations
20 Claims
-
1. An image processing apparatus for performing optical character recognition, comprising:
-
an input component for receiving a textual image of a document; a segmentation component for detecting text and images in the document and identifying word positions; a reading order component for arranging words into textual regions and arranging the textual regions in a correct reading order; a text recognition component for recognizing words and computing text properties concerning individual words and textual lines; a paragraph detection component for arranging textual lines which have been identified in the textual regions into paragraphs; a user interface through which the user provides user input data, wherein the user input data corrects a first mischaracterized item appearing in the document after undergoing OCR; and an error correction component for receiving the user input data and causing a first of the components in which an initial error producing the first mischaracterized item arose to correct the initial error, wherein the error correction component is further configured to cause components that process the image subsequent to the first component to correct consequential errors arising as a result of the initial error. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for correcting a textual image document that has undergone optical character recognition (OCR), comprising:
-
receiving an electronic model of the image document after it has undergone an OCR process, the electronic model including elements of the image document that have been determined by each of a plurality of sequentially executed stages in the OCR process, wherein the electronic model serves as input information that is supplied to each of the stages by a previous stage that processed the image document; presenting a graphical user interface to a user that receives user input data correcting a first mischaracterized item appearing in the document after undergoing OCR; based at least in part on the user input data, causing a first of the stages of the OCR process that produced an initial error that gave rise to the first mischaracterized item to correct the initial error; and causing stages of the OCR process subsequent to the first stage to correct consequential errors arising in their respective stages as a result of the initial error. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A medium comprising instructions executable by a computing system, wherein the instructions configure the computing system to perform a method for correcting a textual image of a document that has undergone OCR, comprising:
-
receiving an electronic model of the image after it has undergone an OCR process, the electronic model including elements of the image that have been determined by each of a plurality of sequentially executed stages in the OCR process, wherein the electronic model serves as input information that is supplied to each of the stages by a previous stage that processed the image document; based on user input data that corrects mischaracterized items in the image after it has undergone the OCR process, identifying a first stage of the OCR process that produced an initial error that gave rise to the first mischaracterized item; correcting the initial error by re-executing the first stage of the OCR process at least in part; and correcting consequential errors arising in stages of the OCR process subsequent to the first stage as a result of the initial error. - View Dependent Claims (19, 20)
-
Specification