Character recognition machine utilizing language processing
First Claim
1. A character extraction machine comprising:
- a character string image input portion for receiving an image consisting of a character string;
a memory for holding the received character string;
a projection profile histogram-calculating portion for calculating a concentration value of histogram which is obtained by counting pixels in the character string;
an initial parameter-setting portion for receiving an output from said projection profile histogram-calculating portion and determining values of processing parameters from heights of the character string, lengths of spaces of projection profile histogram of the character string, and other factors which are obtained from the projection profile histogram, the parameters being used to estimate character positions;
a histogram distribution smoothing network portion formed by a network for receiving an output from said projection profile histogram-calculating portion and an output from said initial parameter-setting portion, said network being composed of interconnected operators each having a multiplicity of inputs and one output, said operators corresponding to positions of pixels in a direction of the character string, and said histogram distribution smoothing network portion for acting to minimize a function which assumes its minimum value when values of the projection profile histogram of character portions have substantially the same value, to thereby smooth the projection profile histogram, thus reducing effects of noises present in the received character string;
an extraction position-estimating network portion formed by a second network for receiving an output from said projection profile histogram-calculating portion and an output from said initial parameter-setting portion, said second network being composed of interconnected operators each having a multiplicity of inputs and one output, said operators corresponding to positions of pixels in a direction of the character string, and said extraction position-estimating network portion for acting to minimize a function which assumes its minimum value when an extracted position of the whole character string has been optimally extracted, to thereby estimate an extraction position of the character;
an extraction position-determining portion for determining character extraction positions according to values outputted from said extraction position-estimating network portion;
a character pattern output portion for reading a character pattern of each character from said memory according to a signal outputted from said extraction position-determining portion;
a character recognition portion for recognizing what character is represented by each character pattern produced from said character pattern output portion;
a language processing portion for receiving an output from said character recognition portion and calculating an evaluation value of each phrase of the character string applied heretofore, said evaluation value indicating a degree of correctness in terms of vocabulary and grammar;
a final character position-determining portion for receiving an output from said language processing portion and determining whether characters in the phrase have been correctly extracted in terms of vocabulary and grammar and which, if the characters have not been correctly extracted, producing positions of a first character and a last character because probability of erroneous estimation of character positions in said phrase is high and which, if the characters have been correctly extracted, producing these character positions as final character extraction positions;
a re-extraction position-determining portion for receiving an output from said final character position-determining portion and determining a position at which a character position should be estimated again;
a character squareness degree-modifying portion for estimating a squareness degree from a character string which has been judged to have correct character positions by said final character position-determining portion; and
a parameter-modifying portion for receiving an output from said re-extraction position-determining portion and an output from said character squareness degree-modifying portion and modifying said processing parameters used to estimate the character positions.
0 Assignments
0 Petitions
Accused Products
Abstract
There is disclosed a character recognition machine adapted to recognize Japanese characters such as kanjis and kanas. The machine comprises a character string storage portion, a character extraction portion, a character recognition portion, and a language processing portion. A character string to be recognized is stored as an image in the storage portion. The character extraction portion comprises a network consisting a plurality of interconnected operators each of which has numerous inputs and outputs. An evaluation function which assumes its minimum value when a character extraction produces the best results is calculated by the operators simultaneously so as to minimize the value of the function. The character recognition portion calculates degrees of similarity of a character pattern to various character categories, the character pattern being applied from the character extraction portion. The language processing portion receives these degrees of similarity and selects a character category which seems to provide the most correct combination of characters in terms of vocabulary and grammar.
-
Citations
21 Claims
-
1. A character extraction machine comprising:
-
a character string image input portion for receiving an image consisting of a character string; a memory for holding the received character string; a projection profile histogram-calculating portion for calculating a concentration value of histogram which is obtained by counting pixels in the character string; an initial parameter-setting portion for receiving an output from said projection profile histogram-calculating portion and determining values of processing parameters from heights of the character string, lengths of spaces of projection profile histogram of the character string, and other factors which are obtained from the projection profile histogram, the parameters being used to estimate character positions; a histogram distribution smoothing network portion formed by a network for receiving an output from said projection profile histogram-calculating portion and an output from said initial parameter-setting portion, said network being composed of interconnected operators each having a multiplicity of inputs and one output, said operators corresponding to positions of pixels in a direction of the character string, and said histogram distribution smoothing network portion for acting to minimize a function which assumes its minimum value when values of the projection profile histogram of character portions have substantially the same value, to thereby smooth the projection profile histogram, thus reducing effects of noises present in the received character string; an extraction position-estimating network portion formed by a second network for receiving an output from said projection profile histogram-calculating portion and an output from said initial parameter-setting portion, said second network being composed of interconnected operators each having a multiplicity of inputs and one output, said operators corresponding to positions of pixels in a direction of the character string, and said extraction position-estimating network portion for acting to minimize a function which assumes its minimum value when an extracted position of the whole character string has been optimally extracted, to thereby estimate an extraction position of the character; an extraction position-determining portion for determining character extraction positions according to values outputted from said extraction position-estimating network portion; a character pattern output portion for reading a character pattern of each character from said memory according to a signal outputted from said extraction position-determining portion; a character recognition portion for recognizing what character is represented by each character pattern produced from said character pattern output portion; a language processing portion for receiving an output from said character recognition portion and calculating an evaluation value of each phrase of the character string applied heretofore, said evaluation value indicating a degree of correctness in terms of vocabulary and grammar; a final character position-determining portion for receiving an output from said language processing portion and determining whether characters in the phrase have been correctly extracted in terms of vocabulary and grammar and which, if the characters have not been correctly extracted, producing positions of a first character and a last character because probability of erroneous estimation of character positions in said phrase is high and which, if the characters have been correctly extracted, producing these character positions as final character extraction positions; a re-extraction position-determining portion for receiving an output from said final character position-determining portion and determining a position at which a character position should be estimated again; a character squareness degree-modifying portion for estimating a squareness degree from a character string which has been judged to have correct character positions by said final character position-determining portion; and a parameter-modifying portion for receiving an output from said re-extraction position-determining portion and an output from said character squareness degree-modifying portion and modifying said processing parameters used to estimate the character positions. - View Dependent Claims (2)
-
-
3. A pattern recognition machine comprising:
-
a plurality of feature-extracting portions, each of which is for finding each feature vector from an applied pattern; a plurality of single feature-recognizing portions, each of which is for finding degrees of similarity of the applied pattern from each feature vector, the degrees of similarity indicating degrees to which the applied pattern belong to various categories; a category-discerning portion for discerning the applied pattern using the degrees of similarity obtained by said plurality of single feature-recognizing portions; each of said single feature-recognizing portions including; a group dictionary storing a plurality of group references feature vectors representing category groups each consisting of a set of patterns having similar feature vectors; a fuzzy rough classification portion for calculating group assignment degrees from the group reference feature vectors and from feature vectors of the applied pattern, the group assignment degree indicating degrees to which the applied pattern belong to the category groups, respectively; a plurality of subclassification portions for finding inside group similarity degrees from the feature vectors of the applied pattern, the inside group similarity degrees indicating degrees to which the applied pattern belongs to categories contained in the category groups; a group-selecting portion for selecting a plurality of category groups from the group assignment degrees; a subclassification portion input signal-selecting portion for selecting the subclassification portion which can receive the feature vectors of the applied pattern according to information about selection of groups obtained from said group-selecting portion; and a single feature similarity degree-calculating portion for finding degrees of similarity of the applied pattern to the categories from a group assignment degree of the category group selected by said group-selecting portion and from the inside group similarity degrees obtained by said selected subclassification portions; said single feature similarity degree-calculating portion including; a plurality of multipliers for multiplying the group assignment degree of the category group selected by said group-selecting portion by all inside group similarity degrees, respectively, obtained from said subclassification portions receiving the feature vectors of the applied pattern by said subclassification portion input signal-selecting portion; and a category similarity degree-calculating portion for selecting plural output values having larger values from output values from the multipliers for each category and calculating a sum of these output values; said category-discerning portion including; a plurality of similarity degree-normalizing portions for normalizing the degrees of similarity of the categories obtained from said single feature-recognizing portions corresponding to the feature vectors, respectively, with a maximum one of these degrees of similarity to thereby transform the degrees of similarity into normalized degrees of similarity; and a category decision portion for finding integrated degrees of similarity of the categories using the normalized degrees of similarity obtained by said similarity degree-normalizing portions and discerning the applied pattern. - View Dependent Claims (10, 11, 12)
-
-
4. A character recognition machine comprising:
-
a character recognition portion including; a plurality of feature-extracting portions, each of which is for finding each feature vector from a character pattern produced by a character extraction portion which divides a text image into plural regions for each character; a plurality of feature-recognizing portions, each of which is for finding degrees of similarity from each feature vector, the degrees of similarity indicating degrees at which the character pattern belongs to various character categories, respectively; a candidate character-selecting portion for finding a plurality of second candidate character categories and their degrees of similarity to the character pattern from the degrees of similarity obtained from said single feature-recognizing portions; and a post-recognition processing portion for selecting a final character category from the second character candidate categories using information about sizes and positions of the character pattern; each of said single feature-recognizing portions including; a group dictionary storing a plurality of group reference feature vectors representing character category groups each consisting of a set of character pattern having similar feature vectors; a fuzzy rough classification portion for calculating group assignment degrees from the group reference feature vectors and from feature vectors of the applied character pattern, the group assignment degree indicating degrees to which the applied character pattern belongs to the character category groups, respectively; a plurality of subclassification portions for finding inside group similarity degrees from the feature vectors of the applied character pattern, the inside group similarity degrees indicating degrees to which the applied character pattern belongs to character categories contained in the character category groups; a group-selecting portion for selecting a plurality of character category groups from the group assignment degrees; a subclassification portion input signal-selecting portion for selecting the subclassification portion which can receive the feature vectors of the applied character pattern according to information about selection of groups obtained from said group-selecting portion; and a single feature similarity degree-calculating portion for finding degrees of similarity of the applied character pattern to the character categories from a group assignment degree of the character category group selected by said group-selecting portion and from the inside group similarity degrees obtained by said selected subclassification portions; said single feature similarity degree-calculating portion including; a plurality of multipliers for multiplying the group assignment degree of the character category group selected by said group-selecting portion by all inside group similarity degrees, respectively, obtained from said subclassification portions receiving the feature vectors of the applied character pattern by said subclassification portion input signal-selecting portion; and a category similarity degree-calculating portion for selecting plural output values having larger values from output values having larger values from output values from the multipliers for each character category and calculating a sum of these output values; said candidate character-selecting portion including; a plurality of similarity degree-normalizing portions for normalizing the degrees of similarity of various character categories obtained from said single feature-recognizing portions corresponding to the feature vectors, respectively, with a maximum one of these degrees of similarity to thereby transform the degrees of similarity into normalized degrees of similarity; a plurality of first candidate character-selecting portions for selecting plural first candidate character categories from all character categories using information about sizes and positions in the character pattern and the normalized degrees of similarity; and a second character-selecting portion for summing up all normalized degrees of similarity of the selected first candidate character categories for each same category, finding integrated degrees of similarity of all the first candidate character categories, selecting plural first candidate character categories having larger values as second candidate character categories, and delivering the second candidate character categories and their integrated degrees of similarity. - View Dependent Claims (5, 6, 7, 8, 9)
-
-
13. A character recognition machine having a language processing portion capable of processing language, said language processing portion comprising:
-
a character string image storage means for storing an image input; a character recognition means for outputting a set of candidate characters and an evaluation; a word search means for receiving the set of candidate characters and determining a set of candidate words, using a word dictionary, based on a degree of similarity between the candidate characters and the characters in various character positions in the character string image; a phrase search means for finding candidate phrases from the set of candidate words, using a grammatical dictionary; a phrase evaluating value-calculating means for calculating correctness of each phrase in terms of vocabulary and grammar; a phrase-selecting means for selecting one phrase, based on the evaluating values of phrases, and producing a modified character string; a word incorrectness degree-calculating means for finding an incorrect recognition degree of each word in the modified character string, based on data about correctness percentage of each correct word and the evaluation made by said character recognition means; a word incorrect amendment degree-calculating means for finding an incorrect amendment degree of each word in the modified character string; and a reject processing means for determining characters to be rejected from the word incorrect amendment degree.
-
-
14. A character recognition machine having a language processing portion capable of processing language, said language processing portion comprising:
-
a character string image storage means for storing an image input; a word search means for receiving a set of candidate characters and determining a set of candidate words, using a word dictionary, based on a degree of similarity between the candidate characters and the characters in various character positions in the character string image; a phrase search means for finding candidate phrases from the set of candidate words, using a grammatical dictionary; a phrase evaluating value-calculating means for calculating correctness of each phrase in terms of vocabulary and grammar; a phrase-selecting means for selecting one phrase, based on evaluating values of phrases, and producing a modified character string; a word incorrectness degree-calculating means for finding an incorrect recognition degree of each word in the modified character string, based on data about correctness percentage of each amended word and an evaluation made by a character recognition means; a word incorrect amendment degree-calculating means for finding a word incorrect amendment degree of each word in the modified character string; a reject processing means for determining characters to be rejected from the word incorrect amendment degree; and a rejected character-replacing means for replacing the rejected characters in the modified character string by a character category having a maximum degree of similarity and outputted from said character recognition means.
-
-
15. A character recognition machine comprising:
-
a character string image storage portion, a character extraction portion, a character recognition portion, and a language processing portion; and said character string image storage portion for storing a character string to be recognized as a character string image; said character extraction portion connected with said character string image storage portion and comprising a plurality of operators arranged in a direction of said character string, each of said operators having a multiplicity of inputs and one outputs, the operators being interconnected so as to form a network, and said character extraction portion acting to determine boundary positions of characters by the use of values of initialized processing parameters, to obtain a character pattern of each character and information about sizes and positions in this character pattern, to deliver these character patterns and information to said character recognition portion, to again process a text image in character positions specified by said character recognition portion or by said language processing portion with modified values of said processing parameters, to find new boundary positions of characters in said character positions, to obtain a new character pattern and new information about sizes and positions in this character pattern, and to again deliver these character pattern and information to said character recognition portion; a character recognition portion for finding degrees of similarity of a character pattern applied from said character extraction portion, said degrees of similarity indicating degrees to which said character pattern belongs to various character categories, said character recognition portion acting to inform said character extraction portion of position of said character pattern in said character string if said degrees of similarity are not matched to information about sizes and positions in the character pattern applied from said character extraction portion, and if said degrees of similarity are matched to information about sizes and positions in the character pattern applied from said character extraction portion, said character recognition portion acting to deliver said degrees of similarity to said language processing portion, or to deliver degrees of similarity to the character category group restricted by said language processing portion for a character pattern in character positions specified by said language processing portion, and to learn a character pattern as a teacher pattern if said language processing portion specifies a correct character category, said character pattern being in character positions of said specified correct character category, and a language processing portion for receiving said degrees of similarity of character categories in various character positions found by said character recognition portion, selecting as a forecasted correct character category a combination of character categories which is correctest in terms of vocabulary and grammar, producing certain character categories and character positions corresponding to a certain character to said character recognition portion if categories of said certain forecasted correct character are restricted to said certain character category, producing said forecasted correct character and character positions corresponding to said forecested correct character to said character recognition portion if said forecasted correct character category differs from a character category applied from said character recognition portion and having a maximum degree of similarity, and producing character positions corresponding to a part to said character extraction portion if said part has no combination of character categories which seems to be correct in terms of vocabulary and grammar. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A character recognition machine having a language processing portion capable of processing language, said language processing portion comprising:
-
a character string image storage means for storing an image input; a word search means for receiving a set of candidate characters and determining a set of candidate words, using a word dictionary, based on a degree of similarity between the candidate characters and the characters in various character positions in the character string image; a phrase search means for finding candidate phrases from the set of candidate words, using a grammatical dictionary; a phrase evaluating value-calculating means for calculating correctness of each phrase in terms of vocabulary and grammar; a phrase-selecting means for selecting one phrase, based on the evaluating values of phrases, and producing a modified character string; a word incorrectness degree-calculating means for finding an incorrect recognition degree of each word in the modified character string, based on data about correctness percentage of each correct word and an evaluation made by a character recognition portion; a word incorrect amendment degree-calculating means for finding a word incorrect amendment degree of each word in the modified character string; and a reject processing means for determining characters to be rejected from said word incorrect amendment degree and for outputting the rejected characters as misrecognized characters.
-
-
21. A character recognition machine having a language processing portion capable of processing language, said language processing portion comprising:
-
a character string image storage means for storing an image input; a word search means for receiving a set of candidate characters and determining a set of candidate words, using a word dictionary, based on a degree of similarity between the candidate characters and the characters in various character positions in the character string image; a phrase search means for finding candidate phrases from the set of candidate words, using a grammatical dictionary; a phrase evaluating value-calculating means for calculating correctness of each phrase in terms of vocabulary and grammar; a phrase-selecting means for selecting one phrase, based on evaluating values of phrases, and producing a modified character string; a word incorrectness degree-calculating means for finding an incorrect recognition degree of each word in the modified character string, based on data about correctness percentage of each amended word and an evaluation made by a character recognition portion; a word incorrect amendment degree-calculating means for finding a word incorrect amendment degree of each word in the modified character string; a reject processing means for determining characters to be rejected from the word incorrect amendment degree; a rejected character-replacing means for replacing the rejected characters in the modified character string by a character category having a maximum degree of similarity and outputted from said character recognition means; and a re-extraction position-indicating means for producing a re-extraction position of the rejected characters to a character extraction means which divides a text image into plural regions for each character.
-
Specification