×

Method and apparatus for character recognition

  • US 6,341,176 B1
  • Filed: 11/13/1997
  • Issued: 01/22/2002
  • Est. Priority Date: 11/20/1996
  • Status: Expired due to Fees
First Claim
Patent Images

1. A character recognizing method in which reference text data to be referred to character recognition and an index file of the reference text data are provided, the method comprising the steps of:

  • recognizing an input character image indicating an input character of an input document as one or more conversion candidate characters denoting candidates for the input character for each of input character images indicating input characters of the input document, the one or more conversion candidate characters each being composed of text data;

    selecting a series of search character images indicating a series of search input characters from the series of input character images;

    selecting a plurality of particular conversion candidate character strings respectively corresponding to the series of search character images from the particular conversion candidate characters;

    searching the reference text data, by using a full text searching technique based on the index file of the reference text data, for one or more particular character strings respectively agreeing with one particular conversion candidate character string for each of the particular conversion candidate character strings to count the number of particular character strings as an occurrence frequency of the particular conversion candidate character string in the reference text data for each of the particular conversion candidate character strings;

    selecting a specific particular conversion candidate character string corresponding to the highest occurrence frequency among those of the particular conversion candidate character strings from the particular conversion candidate character strings; and

    determining a series of specific particular conversion candidate characters composing the specific particular conversion candidate character string as a series of correct characters for the series of search character images.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×