×

USER CORRECTION OF ERRORS ARISING IN A TEXTUAL DOCUMENT UNDERGOING OPTICAL CHARACTER RECOGNITION (OCR) PROCESS

  • US 20110280481A1
  • Filed: 05/17/2010
  • Published: 11/17/2011
  • Est. Priority Date: 05/17/2010
  • Status: Abandoned Application
First Claim
Patent Images

1. An image processing apparatus for performing optical character recognition, comprising:

  • an input component for receiving a textual image of a document;

    a segmentation component for detecting text and images in the document and identifying word positions;

    a reading order component for arranging words into textual regions and arranging the textual regions in a correct reading order;

    a text recognition component for recognizing words and computing text properties concerning individual words and textual lines;

    a paragraph detection component for arranging textual lines which have been identified in the textual regions into paragraphs;

    a user interface through which the user provides user input data, wherein the user input data corrects a first mischaracterized item appearing in the document after undergoing OCR; and

    an error correction component for receiving the user input data and causing a first of the components in which an initial error producing the first mischaracterized item arose to correct the initial error, wherein the error correction component is further configured to cause components that process the image subsequent to the first component to correct consequential errors arising as a result of the initial error.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×