Character extraction apparatus, dictionary production apparatus and character recognition apparatus, using both apparatuses
First Claim
1. A character recognition apparatus, comprising:
- a character image extracting means for extracting character images from a text image;
a candidate character selecting means for comparing the character images with standard characters for selecting a plurality of candidate characters with higher matching levels, and assigning first evaluation values to the candidate characters according to the matching levels;
a character rectangle data dictionary for prestoring shape data of circumscribed rectangles of the standard characters;
a character rectangle extracting means for extracting position data of the circumscribed rectangles of the character images extracted by the character image extracting means;
a character rectangle shape data extracting means for extracting normalized shape data from the position data of the circumscribed rectangles of the character images extracted by the character rectangle extracting means;
a character rectangle evaluating means for obtaining a second evaluation value for each candidate character by a certain computation using the normalized shape data extracted by the character rectangle shape data extracting means and the shape data stored in the character rectangle data dictionary; and
a character determining means for determining a character among the plurality of candidate characters selected by the candidate character selecting means based on the first evaluation values and the second evaluation values.
0 Assignments
0 Petitions
Accused Products
Abstract
A character extraction apparatus is provided for extracting character data for each character from a text image which is represented by first pixels corresponding to character images and second pixels corresponding to background images. The character extraction apparatus comprises a character row detecting means for detecting character rows from the text image and obtaining position data of each character row; a pixel array extracting means for extracting arrays of continuous first pixels in an area specified by the character row position data and computing position data of each of the arrays of continuous first pixels; a character array linking means for linking the arrays of continuous first pixels in the area based on the position data of the arrays of continuous first pixels; and a character extracting means for recognizing each set of arrays of continuous first pixels linked by the character array linking means as a character and outputting character data.
-
Citations
20 Claims
-
1. A character recognition apparatus, comprising:
-
a character image extracting means for extracting character images from a text image; a candidate character selecting means for comparing the character images with standard characters for selecting a plurality of candidate characters with higher matching levels, and assigning first evaluation values to the candidate characters according to the matching levels; a character rectangle data dictionary for prestoring shape data of circumscribed rectangles of the standard characters; a character rectangle extracting means for extracting position data of the circumscribed rectangles of the character images extracted by the character image extracting means; a character rectangle shape data extracting means for extracting normalized shape data from the position data of the circumscribed rectangles of the character images extracted by the character rectangle extracting means; a character rectangle evaluating means for obtaining a second evaluation value for each candidate character by a certain computation using the normalized shape data extracted by the character rectangle shape data extracting means and the shape data stored in the character rectangle data dictionary; and a character determining means for determining a character among the plurality of candidate characters selected by the candidate character selecting means based on the first evaluation values and the second evaluation values. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 12, 13, 14, 15, 16, 17, 18, 19)
-
- 9. The character recognition apparatus of claim 9, wherein the threshold value used in the part extracting unit is a half of a maximum value of heights of the circumscribed rectangles of the character images.
-
20. A computer-readable recording medium storing a program which allows a computer to function as a character recognition apparatus, the computer-readable recording medium storing:
-
a character rectangle data dictionary for prestoring shape data circumscribed rectangles of standard characters, wherein the character recognition apparatus comprises; a character image extracting means for extracting character images from a text image; a candidate character selecting means for comparing the character images with the standard characters for selecting a plurality of candidate characters with higher matching levels, and assigning first evaluation values to the candidate characters according to the matching levels; a character rectangle extracting means for extracting position data of the circumscribed rectangles of the character images extracted by the character image extracting means; a character rectangle shape data extracting means for extracting normalized shape data from the position data of the circumscribed rectangles of the character images extracted by the character rectangle extracting means; a character rectangle evaluating means for obtaining a second evaluation value for each candidate character by a certain computation using the normalized shape data extracted by the character rectangle shape data extracting means and the shape data stored in the character rectangle data dictionary; and a character determining means for determining a character among the plurality of candidate characters selected by the candidate character selecting means based on the first evaluation values and the second evaluation values.
-
Specification