Method of reading characters and method of reading postal addresses
First Claim
1. A method of reading characters by converting image information of a written surface into an electrical signal and reading characters of a character string included in the image information, said method comprising:
- a first step of locating a character string description region in the electrical signal of the image information, and segmenting image information of a character string in the character string region into multiple tentative character patterns;
a second step of implementing the character classification for the tentative character patterns by making reference to a character classification dictionary thereby to obtain multiple recognition candidate characters for each tentative character pattern;
a third step of obtaining border information for the tentative character patterns;
a fourth step of obtaining the credibility of the border information of the tentative character patterns obtained in said third step by making reference to a segmentation dictionary which contains the border information by using the recognition-candidate characters obtained in said second step as the key, and applying weights to the tentative character patterns;
a fifth step of determining the character segmentation in accordance with the weights of tentative character patterns; and
a sixth step of implementing the word-wise matching by using the character classification dictionary based on a set of classified character species produced from the tentative character patterns determined in the fifth step, and identifying the characters of the character string.
2 Assignments
0 Petitions
Accused Products
Abstract
A character reading method has enhanced character segmentation accuracy and character string recognition accuracy for reading correctly hand-written addresses on postal matters. The method extracts provisional character patterns from image information of the address character string (step 206), creates a table 219 of tentative character patterns and implements the character classification for the tentative character patterns (step 207), extracts, specifically for characters of the street number portion of the address character string, periphery information (vertical and horizontal lengths, vertical/horizontal length ratio, pattern spacings, etc.) of tentative character patterns (step 212), and segments the character string into characters accurately based on the information (step 215).
121 Citations
13 Claims
-
1. A method of reading characters by converting image information of a written surface into an electrical signal and reading characters of a character string included in the image information, said method comprising:
-
a first step of locating a character string description region in the electrical signal of the image information, and segmenting image information of a character string in the character string region into multiple tentative character patterns;
a second step of implementing the character classification for the tentative character patterns by making reference to a character classification dictionary thereby to obtain multiple recognition candidate characters for each tentative character pattern;
a third step of obtaining border information for the tentative character patterns;
a fourth step of obtaining the credibility of the border information of the tentative character patterns obtained in said third step by making reference to a segmentation dictionary which contains the border information by using the recognition-candidate characters obtained in said second step as the key, and applying weights to the tentative character patterns;
a fifth step of determining the character segmentation in accordance with the weights of tentative character patterns; and
a sixth step of implementing the word-wise matching by using the character classification dictionary based on a set of classified character species produced from the tentative character patterns determined in the fifth step, and identifying the characters of the character string. - View Dependent Claims (2, 3)
-
-
4. A method of reading a postal address comprising:
-
a first step of converting image information, which includes character string information having a town name portion and a street number portion, into an electrical signal;
a second step of locating a character string description region in the electrical signal of the image information, and extracting combinations of connected image components, which form characters in the character string description region, as tentative character patterns;
a third step of implementing the character classification for each of the tentative character patterns by making reference to the character classification dictionary thereby to obtain recognition candidate characters and the similarity of tentative character patterns and the recognition-candidate characters;
a fourth step of forming a lattice consisting of the recognition-candidate characters, implementing the matching for the lattice with a town name dictionary thereby to identify character strings of the town name portion in the tentative character patterns, and detecting the head position of the street number portion;
a fifth step of extracting, based on the information of the head position obtained in said fourth step, periphery information of tentative character patterns which correspond to recognition-candidate characters of tentative character patterns in the street number portion, and applying weights to the tentative character patterns for evaluating the credibility of the periphery information of the tentative character patterns by making reference to the segmentation dictionary, which contains likelihood of the periphery information, by using the recognition-candidate character as the key;
a sixth step of segmenting the street number portion into characters based on the weights; and
a seventh step of implementing the word-wise matching with a street number dictionary for a set of character classification results produced in said sixth step thereby to identify the character string of street number. - View Dependent Claims (5, 6, 7)
-
-
8. A method of reading characters with a postal address reading apparatus having an image input means for converting image information of a written surface into an electrical signal and means of reading out of the image a character string written on the surface, said method comprising:
-
a first step of extracting the signal of the character string from the electrical signal of the image;
a second step of extracting a tentative character pattern which is deemed to form a character from the signal of the character string, or, in case a tentative character pattern cannot be determined uniquely, extracting a plurality of tentative character patterns;
a third step of implementing the character classification for the extracted tentative character pattern;
a fourth step of calculating the external form penalty based on the assessment of the periphery information depending on the possible types of error of character segmentation; and
a fifth step of confining candidates of tentative character patterns in accordance with the character classification result of said step 3 and the external form penalty calculated in said fourth step, and implementing the matching for the character pattern candidates with character strings stored in advance in a dictionary which contains character strings that can possibly be written on written surfaces, thereby recognizing the character string written on the written surface. - View Dependent Claims (9, 10, 11, 12, 13)
a step of extracting a tentative character pattern which is deemed to form a character string from the image of the character string, or, in case a tentative character pattern cannot be determined uniquely, extracting a plurality of tentative character patterns;
a step of entering information on as to whether or not the tentative character pattern is segmented correctly, with types of segmentation error being sorted manually in the case of incorrect segmentation;
a step of storing tentative character patterns in a memory by sorting the tentative character patterns depending on the result of said incorrect segmentation judgment step; and
a step of implementing the learning of a classifying device by using the tentative character patterns stored in the memory by said pattern storing step.
-
Specification