×

Character recognition machine utilizing language processing

  • US 5,987,170 A
  • Filed: 11/06/1997
  • Issued: 11/16/1999
  • Est. Priority Date: 09/28/1992
  • Status: Expired due to Term
First Claim
Patent Images

1. A character extraction machine comprising:

  • a character string image input portion for receiving an image consisting of a character string;

    a memory for holding the received character string;

    a projection profile histogram-calculating portion for calculating a concentration value of histogram which is obtained by counting pixels in the character string;

    an initial parameter-setting portion for receiving an output from said projection profile histogram-calculating portion and determining values of processing parameters from heights of the character string, lengths of spaces of projection profile histogram of the character string, and other factors which are obtained from the projection profile histogram, the parameters being used to estimate character positions;

    a histogram distribution smoothing network portion formed by a network for receiving an output from said projection profile histogram-calculating portion and an output from said initial parameter-setting portion, said network being composed of interconnected operators each having a multiplicity of inputs and one output, said operators corresponding to positions of pixels in a direction of the character string, and said histogram distribution smoothing network portion for acting to minimize a function which assumes its minimum value when values of the projection profile histogram of character portions have substantially the same value, to thereby smooth the projection profile histogram, thus reducing effects of noises present in the received character string;

    an extraction position-estimating network portion formed by a second network for receiving an output from said projection profile histogram-calculating portion and an output from said initial parameter-setting portion, said second network being composed of interconnected operators each having a multiplicity of inputs and one output, said operators corresponding to positions of pixels in a direction of the character string, and said extraction position-estimating network portion for acting to minimize a function which assumes its minimum value when an extracted position of the whole character string has been optimally extracted, to thereby estimate an extraction position of the character;

    an extraction position-determining portion for determining character extraction positions according to values outputted from said extraction position-estimating network portion;

    a character pattern output portion for reading a character pattern of each character from said memory according to a signal outputted from said extraction position-determining portion;

    a character recognition portion for recognizing what character is represented by each character pattern produced from said character pattern output portion;

    a language processing portion for receiving an output from said character recognition portion and calculating an evaluation value of each phrase of the character string applied heretofore, said evaluation value indicating a degree of correctness in terms of vocabulary and grammar;

    a final character position-determining portion for receiving an output from said language processing portion and determining whether characters in the phrase have been correctly extracted in terms of vocabulary and grammar and which, if the characters have not been correctly extracted, producing positions of a first character and a last character because probability of erroneous estimation of character positions in said phrase is high and which, if the characters have been correctly extracted, producing these character positions as final character extraction positions;

    a re-extraction position-determining portion for receiving an output from said final character position-determining portion and determining a position at which a character position should be estimated again;

    a character squareness degree-modifying portion for estimating a squareness degree from a character string which has been judged to have correct character positions by said final character position-determining portion; and

    a parameter-modifying portion for receiving an output from said re-extraction position-determining portion and an output from said character squareness degree-modifying portion and modifying said processing parameters used to estimate the character positions.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×