×

Confusion matrix based method and system for correcting misrecognized words appearing in documents generated by an optical character recognition technique

  • US 6,154,579 A
  • Filed: 08/11/1997
  • Issued: 11/28/2000
  • Est. Priority Date: 08/11/1997
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of recognizing at least one word in a document, the word including at least one predetermined character, the method comprising the steps of:

  • a) providing a recognized word based on the word in the document;

    b) determining whether the recognized word is correct;

    c) generating, if the recognized word is incorrect, at least one reference word, each reference word comprising a different set of predetermined characters and being generated according to a process that is independent of a content of at least one confusion matrix;

    d) determining for each reference word a corresponding replacement word value based on a calculation involving a mathematical function being applied to a content of the at least one confusion matrix; and

    e) replacing the incorrect recognized word with the reference word most likely matching the at least one word in the document based on the corresponding replacement word value.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×