×

Post-processing system and method for correcting machine recognized text

  • US 20040086179A1
  • Filed: 11/04/2002
  • Published: 05/06/2004
  • Est. Priority Date: 11/04/2002
  • Status: Active Grant
First Claim
Patent Images

1. A post-processor for character data of an optical character recognition (OCR) engine comprising:

  • a word segmentation engine coupled to the OCR engine to segment the character data into a plurality of initial words;

    a word level processor coupled to the word segmentation engine to process the plurality of initial words and determine a set of candidate words corresponding to each initial word;

    a sentence segmentation engine coupled to the word level processor to segment the plurality of initial words into at least one sentence; and

    a word disambiguity processor coupled to the sentence segmentation engine to determine a final word from each set of candidate words;

    wherein the word disambiguity processor processes each sentence of the at least one sentence separately.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×