×

Character recognition method and apparatus

  • US 5,768,451 A
  • Filed: 06/02/1997
  • Issued: 06/16/1998
  • Est. Priority Date: 12/22/1993
  • Status: Expired due to Fees
First Claim
Patent Images

1. A character recognition method comprising the steps of:

  • inputting a digital image including an image of a character sequence;

    recognizing the image of the character sequence as a sequence of separated character patterns;

    outputting a group of candidate characters, for each of the character patterns, along with a numerical confidence ratio valuegiven by an optical character reader for each output candidate character in the groups wherein the confidence ratio value provides the certainty of each candidate character as a recognition result of each of the character patterns;

    extracting a group of candidate words from a word dictionary by using keys, each candidate word starting at a character position p, each key being a candidate character sequence including characters taken from the candidate characters corresponding to each position starting at the character position p;

    calculating a word matching cost, for each of the extracted candidate words, by executing a word matching of each candidate word, wherein the word matching cost c(w) for w=C1, C2, . . . , Cm is represented as a sum of character matching costs of characters composing the word, wherein the character matching cost of Ci is a positive value calculated from Ci and Si={(Ci1, Vi1) (Ci2, Vi2), . . . , (CiN, ViN)}, wherein Ci1 is the j-th candidate character for Ci and Vij is its confidence ratio;

    detecting a most feasible candidate word sequence in accordance with a candidate word set, the calculated word matching cost for each candidate word and a grammatical knowledge relating to a connection cost between two words; and

    ,displaying the character sequence generated by connecting the most feasible candidate word sequence;

    wherein the word dictionary contains pairs of entries forming the character sequence and the grammatical knowledge; and

    ,wherein each of the keys is generated from the candidate characters and combinations of the candidate characters, up to a predetermined length L, starting from the position p, words up to the length L being extracted from the dictionary by coincidence with the entries in said dictionary, and words having a length larger than the length L being extracted from the dictionary by coincidence with the first L characters of the entries in the dictionary.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×