×

Method and apparatus for performing an automatic correction of misrecognized words produced by an optical character recognition technique by using a Hidden Markov Model based algorithm

  • US 6,219,453 B1
  • Filed: 08/11/1997
  • Issued: 04/17/2001
  • Est. Priority Date: 08/11/1997
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of recognizing at least one word in a document, the word including at least one predetermined character, the method comprising thea) providing a recognized word based on the word in the document;

  • b) determining whether the recognized word matches the word in the document, the recognized word comprising a misrecognized word in the absence of a match with the word in the document;

    c) if the determining step produces the misrecognized word, generating a set of reference words, wherein each one of the reference words comprises a different set of characters;

    d) determining for each of the reference words a corresponding replacement word value;

    e) prior to selecting the reference word for replacing the misrecognized word, reducing an amount of reference words from the set of reference words in order to form a subset of reference words from the set of reference words by eliminating according to an elimination operation any reference word that does not belong to a predetermined vocabulary, the subset of reference words being limited to only those reference words that belong to the predetermined vocabulary, the elimination operation being different than the generating of the set of reference words; and

    f) selecting from the subset of reference words one reference word for replacing the misrecognized word based on the replacement word values of the corresponding reference words in the subset of the reference words.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×