Character recognition apparatus
First Claim
1. A character recognition apparatus comprising:
- document data input means for inputting document data representing at least one row of characters;
character separating means for extracting the row of characters from the document data input by said document data input means and separating the characters;
histogram forming means for forming a histogram pertaining to a predetermined position of each of the characters of the row of characters which said character separating means has extracted from the document data;
reference line defining means for defining a reference line from the histogram formed by said histogram forming means;
character sorting means for sorting the characters into a plurality of categories in accordance with the positional relation between the characters and the reference line defined by said reference line defining means; and
character recognizing means for recognizing the characters sorted by said character sorting means, using dictionary patterns of a category corresponding to the sorted characters.
1 Assignment
0 Petitions
Accused Products
Abstract
Each row of characters is extracted from document data representing rows of characters by a character row extracting circuit. The characters are separated from the row by a character separating circuit. In a reference line detecting circuit two histograms are formed, one pertaining to the upper sides of the rectangles bordering the extremities of the characters, and the other pertaining to the lower sides of these rectangles. Two reference lines are defined from the histogram. The characters are sorted intoseveral categories, in accordance with their sizes and the positions they take with respect to the reference lines. The pattern of each character is compared with the dictionary patterns of the same category.
45 Citations
16 Claims
-
1. A character recognition apparatus comprising:
-
document data input means for inputting document data representing at least one row of characters; character separating means for extracting the row of characters from the document data input by said document data input means and separating the characters; histogram forming means for forming a histogram pertaining to a predetermined position of each of the characters of the row of characters which said character separating means has extracted from the document data; reference line defining means for defining a reference line from the histogram formed by said histogram forming means; character sorting means for sorting the characters into a plurality of categories in accordance with the positional relation between the characters and the reference line defined by said reference line defining means; and character recognizing means for recognizing the characters sorted by said character sorting means, using dictionary patterns of a category corresponding to the sorted characters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A character recognition apparatus comprising:
-
document data input means for inputting document data representing a plurality of rows of characters including tiny characters, standard-size characters, and characters of other sizes; character separating means for extracting each of the rows of characters from the document data input by said document data input means and separating the characters of each of the rows; histogram forming means for forming a histogram pertaining to at least two predetermined positions of each of the rows of characters which said character separating means has extracted from the document data; reference line defining means for defining two reference lines corresponding to the two positions from the histogram formed by said histogram forming means; character sorting means for sorting the characters into tiny characters, standard-size characters, and characters of other sizes in accordance with the sizes of the characters and the positions of the characters with respect to the two reference lines defined by said reference line defining means; and character recognizing means for recognizing the characters sorted by said character sorting means, by comparing the patterns of these characters with dictionary patterns, wherein said histogram forming means forms a histogram pertaining to all characters of each row, except for the tine ones.
-
-
13. A character recognition apparatus comprising:
-
document data input means for inputting document data representing a plurality of rows of characters including tiny characters, standard-size characters, and characters of other sizes. character separating means for extracting each of the rows of characters from the document data input by said document data input means and separating the characters of each of the rows; histogram forming means for forming a histogram pertaining to at least two predetermined positions of each of the rows of characters which said character separating means has extracted from the document data; reference line defining means for defining two reference lines corresponding to the two positions from the histogram formed by said histogram forming means; character sorting means for sorting the characters into tiny characters, standard-size characters, and characters of other sizes in accordance with the sizes of the characters and the positions of the characters with respect to the two reference lines defined by said reference line defining means; character recognizing means for recognizing the characters sorted by said character sorting means, by comparing the patterns of these characters with dictionary patterns; and a pattern dictionary storing the dictionary patterns sorted in various categories, said pattern dictionary being so designed to output the patterns of characters of any category for comparison with any character of the same category.
-
-
14. A character recognition apparatus comprising:
-
document data input means for inputting document data representing at least one row of characters; character separating means for extracting the row of character from the document data input by said document data input means, and separating the characters from the row; and reference line defining means for simulating straight lines passing specific points of the characters separated from the row, forming a histogram pertaining to the characters in a parameter space define by the coordinates parameters defining the straight lines, detecting, as a reference line, the straight line defined by a coordinates parameter corresponding to the peak of the histogram, and detecting the inclination of this reference line, wherein said reference line detecting means detects the reference line by means of the Hough voting.
-
-
15. A character recognition apparatus comprising:
-
document data input means for inputting document data representing at least one row of characters including tiny characters, standard-size characters, and nonstandard-size characters; character separating means for extracting the row of characters from the document data input by said document data input means and separating the characters; first histogram forming means for forming a first histogram pertaining to the row of characters which said character separating means has extracted from the document data; reference line defining means including means for detecting two peaks of said histogram and defining two first reference lines from these peaks, means for detecting the tiny characters on the basis of the first reference lines, and second histogram forming means for forming a second histogram of all characters, except for the tiny characters, and means for defining two second reference lines from the peaks of this second histogram; character sorting means for sorting the characters into a plurality of categories in accordance with the positional relation between the characters and the second reference lines; and character recognizing means for recognizing the characters sorted by said character sorting means, using dictionary patterns of a category corresponding to the sorted characters. - View Dependent Claims (16)
-
Specification