Character recognition processing method and apparatus
First Claim
1. A character recognition processing method, comprising:
- extracting, by using a computer, a feature vector for an input character from a reading result of said input character;
calculating, by using the computer, distances between said feature vector for said input character and vectors including average vectors stored in a system dictionary storing, for each character, said average vector of said feature vectors and distribution information, and feature vectors stored in a user dictionary storing, for each character, said feature vector;
extracting, by using the computer, a top predetermined number of character codes in an ascending order of the calculated distances;
obtaining, by using the computer, second distribution information for said character codes, which are extracted from said user dictionary and included in said top predetermined number of character codes;
calculating, by using the computer, for each of said top predetermined number of character codes, a second distance with said feature vector for said input character, by using, for said character codes, which are extracted from said user dictionary and included in said top predetermined number of character codes, said second distribution information, and by using, for said character codes, which are extracted from said system dictionary and included in said top predetermined number of character codes, the corresponding distribution information in said system dictionary; and
identifying, by using the computer, a character code whose second distance is shortest.
1 Assignment
0 Petitions
Accused Products
Abstract
This method includes: extracting a feature vector for an input character from a reading result of the input character; calculating distances between the feature vector for the input character and vectors including average vectors stored in a system dictionary storing, for each character, the average vector and distribution information, and feature vectors stored in a user dictionary; extracting the top N character codes in an ascending order of the calculated distances; obtaining second distribution information for the character codes, which are included the user dictionary and in the top N character codes; calculating, for each of the top N character codes, a second distance with the feature vector for the input character, by using, for the character codes, which are included in the user dictionary and in the top N character codes, the second distribution information; and identifying a character code whose second distance is shortest.
9 Citations
11 Claims
-
1. A character recognition processing method, comprising:
-
extracting, by using a computer, a feature vector for an input character from a reading result of said input character; calculating, by using the computer, distances between said feature vector for said input character and vectors including average vectors stored in a system dictionary storing, for each character, said average vector of said feature vectors and distribution information, and feature vectors stored in a user dictionary storing, for each character, said feature vector; extracting, by using the computer, a top predetermined number of character codes in an ascending order of the calculated distances; obtaining, by using the computer, second distribution information for said character codes, which are extracted from said user dictionary and included in said top predetermined number of character codes; calculating, by using the computer, for each of said top predetermined number of character codes, a second distance with said feature vector for said input character, by using, for said character codes, which are extracted from said user dictionary and included in said top predetermined number of character codes, said second distribution information, and by using, for said character codes, which are extracted from said system dictionary and included in said top predetermined number of character codes, the corresponding distribution information in said system dictionary; and identifying, by using the computer, a character code whose second distance is shortest. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-readable storage medium storing a program for causing a computer to execute a character recognition process, comprising:
-
extracting a feature vector for an input character from a reading result of said input character; calculating distances between said feature vector for said input character and vectors including average vectors stored in a system dictionary storing, for each character, said average vector of said feature vectors and distribution information, and feature vectors stored in a user dictionary storing, for each character, said feature vector; extracting a top predetermined number of character codes in an ascending order of the calculated distances; obtaining second distribution information for said character codes, which are extracted from said user dictionary and included in said top predetermined number of character codes; calculating, for each of said top predetermined number of character codes, a second distance with said feature vector for said input character, by using, for said character codes, which are extracted from said user dictionary and included in said top predetermined number of character codes, said second distribution information, and by using, for said character codes, which are extracted from said system dictionary and included in said top predetermined number of character codes, the corresponding distribution information in said system dictionary; and identifying a character code whose second distance is shortest.
-
-
11. A character recognition processing apparatus, comprising:
-
a first extracting unit to extract a feature vector for an input character from a reading result of said input character; a first calculating unit to calculate distances between said feature vector for said input character and vectors including average vectors stored in a system dictionary storing, for each character, said average vector of said feature vectors and distribution information, and feature vectors stored in a user dictionary storing, for each character, said feature vector; a second extracting unit to extract a top predetermined number of character codes in an ascending order of the calculated distances; an obtaining unit to obtain second distribution information for said character codes, which are extracted from said user dictionary and included in said top predetermined number of character codes; a second calculating unit to calculate, for each of said top predetermined number of character codes, a second distance with said feature vector for said input character, by using, for said character codes, which are extracted from said user dictionary and included in said top predetermined number of character codes, said second distribution information, and by using, for said character codes, which are extracted from said system dictionary and included in said top predetermined number of character codes, the corresponding distribution information in said system dictionary; and an identifying unit to identify a character code whose second distance is shortest.
-
Specification