×

Character recognition method using correlation search

  • US 5,150,425 A
  • Filed: 08/30/1991
  • Issued: 09/22/1992
  • Est. Priority Date: 08/30/1991
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of character recognition, comprising the steps of:

  • 1) font training or creating a font of trained characters by;

    (a) acquiring an image composed of a two dimensional array of pixels;

    (b) locating all of the characters in the image by selectively scanning columns or rows of a predetermined area of the image and comparing each pixels intensity with a reference level to determine the first pixel of each character and recording the location (column and row coordinates) of such pixel and identifying the other pixels adjacent to the first whose intensity also exceeds the reference level and recording the upper left and lower right coordinates of a box bounding each character;

    (c) identifying (labeling) all located characters and entering such identified characters as trained characters in memory;

    (d) repeating steps (a) thru (c) for every image in the training set;

    (e) creating a correlation matrix by computing a correlation coefficient for every trained character with every other trained character and storing such matrix in memory;

    (f) defining a list of trained character possibilities for every unknown character to be recognized;

    (g) setting an upper limit on the number of trained characters that will be tested for each unknown character;

    2) recognizing unknown characters by;

    (h) acquiring a two dimensional array of pixels;

    (i) locating all unknown characters in a manner described in (b);

    (j) selecting one of the unknown characters found in step (i) and selecting an initial trained character from the list of trained character possibilities for the unknown character;

    (k) computing a correlation coefficient between the selected unknown character and the trained character;

    (l) performing step (v) and succeeding steps if the correlation coefficient exceeds a threshold;

    (m) performing step (n) and succeeding steps if the correlation coefficient does not exceed a threshold;

    (n) performing step (p) and succeeding steps if the limit on the number of trained characters to be tested defined in step (g) has been exceeded;

    (o) performing step (r) and succeeding steps if the limit on the number of trained characters to be tested defined in step (g) has not been exceeded;

    (p) determining that the unknown character is not recognizable;

    (q) repeating step (j) and succeeding steps until all unknown characters have been processed;

    (r) eliminating the previously selected trained character from the list of trained character possibilities for the unknown character;

    (s) searching along the row of the correlation matrix defined in step (e) corresponding to the most recently selected trained character for an entry that most closely matches the correlation coefficient found in step (k) and selecting as the next trained character to be tested the trained character corresponding to the column of this entry with the requirement that this trained character be contained in the list of trained character possibilities for the unknown character;

    (t) incrementing the count of the number of trained characters tested for the unknown character;

    (u) repeating steps (k) and all succeeding steps;

    (v) saving the currently selected trained character as a candidate character for classifying the unknown character;

    (w) eliminating the currently selected trained character from the list of trained character possibilities for the unknown character;

    (x) performing step (bb) and succeeding steps if there are more entries in the list of trained character possibilities for the unknown character;

    (y) performing step (z) and succeeding steps if there are no more entries in the list of trained character possibilities for the unknown character;

    (z) classifying the unknown character as the most recently defined candidate character;

    (aa) repeating step (j) and succeeding steps until all unknown characters have been processed;

    (bb) searching along the row of the correlation matrix defined in step (e) corresponding to the most recently defined candidate character for an entry that exceeds a threshold and selecting as the next trained character to be tested the trained character corresponding to the column of this entry with the requirement that this trained character be contained in the list of trained character possibilities for the unknown character;

    (cc) performing step (z) and all succeeding steps if a trained character can not be found in step (bb);

    (dd) performing step (ee) and all succeeding steps if a trained character can be found in step (bb);

    (ee) computing a correlation coefficient of the most recently selected trained character and the unknown character;

    (ff) replacing the candidate character with the most recently selected trained character if the correlation coefficient computed in step (ee) exceeds the correlation coefficient of the candidate character and the unknown character;

    (gg) removing the most recently selected trained character from the list of trained character possibilities for the unknown character;

    (hh) repeating step (x) and all succeeding steps.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×