USER CORRECTION OF ERRORS ARISING IN A TEXTUAL DOCUMENT UNDERGOING OPTICAL CHARACTER RECOGNITION (OCR) PROCESS

US 20110280481A1
Filed: 05/17/2010
Published: 11/17/2011
Est. Priority Date: 05/17/2010
Status: Abandoned Application

First Claim

Patent Images

1. An image processing apparatus for performing optical character recognition, comprising:

an input component for receiving a textual image of a document;

a segmentation component for detecting text and images in the document and identifying word positions;

a reading order component for arranging words into textual regions and arranging the textual regions in a correct reading order;

a text recognition component for recognizing words and computing text properties concerning individual words and textual lines;

a paragraph detection component for arranging textual lines which have been identified in the textual regions into paragraphs;

a user interface through which the user provides user input data, wherein the user input data corrects a first mischaracterized item appearing in the document after undergoing OCR; and

an error correction component for receiving the user input data and causing a first of the components in which an initial error producing the first mischaracterized item arose to correct the initial error, wherein the error correction component is further configured to cause components that process the image subsequent to the first component to correct consequential errors arising as a result of the initial error.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

An electronic model of the image document is created by undergoing an OCR process. The electronic model includes elements (e.g., words, text lines, paragraphs, images) of the image document that have been determined by each of a plurality of sequentially executed stages in the OCR process. The electronic model serves as input information which is supplied to each of the stages by a previous stage that processed the image document. A graphical user interface is presented to the user so that the user can provide user input data correcting a mischaracterized item appearing in the document. Based on the user input data, the processing stage which produced the initial error that gave rise to the mischaracterized item corrects the initial error. Stages of the OCR process subsequent to this stage then correct any consequential errors arising in their respective stages as a result of the initial error.

Citations

20 Claims

1. An image processing apparatus for performing optical character recognition, comprising:
- an input component for receiving a textual image of a document;
  
  a segmentation component for detecting text and images in the document and identifying word positions;
  
  a reading order component for arranging words into textual regions and arranging the textual regions in a correct reading order;
  
  a text recognition component for recognizing words and computing text properties concerning individual words and textual lines;
  
  a paragraph detection component for arranging textual lines which have been identified in the textual regions into paragraphs;
  
  a user interface through which the user provides user input data, wherein the user input data corrects a first mischaracterized item appearing in the document after undergoing OCR; and
  
  an error correction component for receiving the user input data and causing a first of the components in which an initial error producing the first mischaracterized item arose to correct the initial error, wherein the error correction component is further configured to cause components that process the image subsequent to the first component to correct consequential errors arising as a result of the initial error.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
- - 2. The image processing apparatus of claim 1 wherein the first of the components is further configured to automatically correct other errors that give rise to other mischaracterized items of a same type as the first mischaracterized item.
  - 3. The image processing apparatus of claim 1 wherein the user interface includes a menu of preselected error types from which the user selects as part of the user input data.
  - 4. The image processing apparatus of claim 3 wherein the preselected error types include a plurality of error types selected from the group consisting of a text region error, a paragraph region error, a paragraph end error, a text line error, a word error and an image region error.
  - 5. The image processing apparatus of claim 1 wherein the user input includes selection of a first error type and, based at least in part on the first error type, the error correction component causes one or more selected components to be re-executed at least in part to correct the initial error.
  - 6. The image processing apparatus of claim 1 wherein the user interface includes a display in which a portion of the textual image is presented after undergoing OCR, said user interface being configured to receive user input correcting the first mischaracterized item and to recognize a user area portion of the display corresponding to the section of the textual image corrected by the user input.
  - 7. The image processing apparatus of claim 1 wherein the consequential errors are corrected in a manner that is consistent with mischaracterized items previously corrected by the user.
  - 8. The image processing apparatus of claim 1 further comprising a memory component for storing an electronic model of the image document, wherein the electronic model includes elements of the image document that are determined by each of the components, and further wherein the electronic model serves as input information that is supplied to each of the components by a previous component that processed the image document.
  - 9. The image processing apparatus of claim 8 wherein the error correction component causes consequential errors arising in the text recognition component to be corrected by incrementally re-executing the text recognition component to process only elements that have been changed.
  - 10. The image processing apparatus of claim 8 wherein the electronic model includes an attribute associated with each of the elements, wherein each of the attributes specifies a confidence level associated with the respective element with which the attribute is associated.
  - 11. The image processing apparatus of claim 10 wherein the initial error arises in at least one of the elements included in the electronic model, wherein the correction component assigns a maximum value to the confidence level of one or more attributes associated with the at least one element after the initial error has been corrected.

12. A method for correcting a textual image document that has undergone optical character recognition (OCR), comprising:
- receiving an electronic model of the image document after it has undergone an OCR process, the electronic model including elements of the image document that have been determined by each of a plurality of sequentially executed stages in the OCR process, wherein the electronic model serves as input information that is supplied to each of the stages by a previous stage that processed the image document;
  
  presenting a graphical user interface to a user that receives user input data correcting a first mischaracterized item appearing in the document after undergoing OCR;
  
  based at least in part on the user input data, causing a first of the stages of the OCR process that produced an initial error that gave rise to the first mischaracterized item to correct the initial error; and
  
  causing stages of the OCR process subsequent to the first stage to correct consequential errors arising in their respective stages as a result of the initial error.
- View Dependent Claims (13, 14, 15, 16, 17)
- - 13. The method of claim 12 wherein presenting the graphical user interface includes requesting the user to categorize an error type to which the mischaracterized item belongs.
  - 14. The method of claim 12 further comprising causing the first stage to correct other errors that give rise to other mischaracterized items of the same time as the first mischaracterized item.
  - 15. The method of claim 12 wherein the user interface includes a menu of preselected error types from which the user selects as part of the user input data.
  - 16. The method of claim 15 wherein the preselected error types include a plurality of error types selected from the group consisting of a text region error, a paragraph region error, a paragraph end error, a text line error, a word error and an image region error.
  - 17. The method of claim 13 further comprising:
    - receiving user input data that includes selection of a first error type; and
      
      based at least in part on the first error type, causing one or more selected components to be re-executed at least in part to correct the initial error.

18. A medium comprising instructions executable by a computing system, wherein the instructions configure the computing system to perform a method for correcting a textual image of a document that has undergone OCR, comprising:
- receiving an electronic model of the image after it has undergone an OCR process, the electronic model including elements of the image that have been determined by each of a plurality of sequentially executed stages in the OCR process, wherein the electronic model serves as input information that is supplied to each of the stages by a previous stage that processed the image document;
  
  based on user input data that corrects mischaracterized items in the image after it has undergone the OCR process, identifying a first stage of the OCR process that produced an initial error that gave rise to the first mischaracterized item;
  
  correcting the initial error by re-executing the first stage of the OCR process at least in part; and
  
  correcting consequential errors arising in stages of the OCR process subsequent to the first stage as a result of the initial error.
- View Dependent Claims (19, 20)
- - 19. The medium of claim 18 wherein correcting the consequential errors comprises correcting the consequential errors arising in the stages of the OCR process subsequent to the first stage as a result of the initial error by re-executing at least in part the respective stages in which the respective consequential errors arise.
  - 20. The medium of claim 19 wherein at least one of the respective stages that is re-executed is incrementally re-executed to only process elements of the electronic model that have changed as a result of correcting the initial error.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Microsoft Corporation
Inventors
Dresevic, Bodin, Vugdelija, Milan, Radakovic, Bogdan, Uzelac, Aleksandar, Todic, Nikola

Application Number

US12/780,991
Publication Number

US 20110280481A1
Time in Patent Office

Days
Field of Search
US Class Current

382/177
CPC Class Codes

G06V 10/987 with the intervention of an...

USER CORRECTION OF ERRORS ARISING IN A TEXTUAL DOCUMENT UNDERGOING OPTICAL CHARACTER RECOGNITION (OCR) PROCESS

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

USER CORRECTION OF ERRORS ARISING IN A TEXTUAL DOCUMENT UNDERGOING OPTICAL CHARACTER RECOGNITION (OCR) PROCESS

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links