Classifier combination for optical character recognition systems utilizing normalized weights and samples of characters
First Claim
Patent Images
1. A method for training an optical character recognition (OCR) system for a character pattern, the method comprising:
- comparing the character pattern to images of the same character, wherein the images are associated with a database of character images;
comparing the character pattern to images of different characters associated with the database of character images;
calculating a weight value for a first classifier for an image sample of the same character based on comparing the character pattern to images of the same character;
calculating a weight value for the first classifier for an image sample of a different character based on comparing the character pattern to images of the different characters;
calculating a weight value for a second classifier for an image sample of the same character based on comparing the character pattern to images of the same character;
calculating a weight value for the second classifier for an image sample of a different character based on comparing the character pattern to images of the different characters;
for each weight value for the first classifier, calculating a corresponding normalized weight for the first classifier, based on a correlation between same character and different character samples of characters which received a same weight value by comparing the said calculated character pattern weight values;
for each weight value for the second classifier, calculating a corresponding normalized weight for the second classifier, based on a correlation between same character and different character samples of characters which received a same weight value by comparing the said calculated character pattern weight value;
calculating corresponding normalized weight values; and
combining the normalized weight values for the first classifier with those normalized weight values associated with the second classifier.
4 Assignments
0 Petitions
Accused Products
Abstract
Techniques and methods are disclosed herein for combining and weighting of values from and associated with classifiers. Classifiers are used to recognize characters as part of an optical character recognition (OCR) system. Various methods of normalization facilitate combining of results of classifiers. For example, weight values may be entered into a weight table having two columns, one that includes weights from comparing patterns with images of correct characters, the other column includes weights from comparing patterns with images of incorrect characters.
-
Citations
19 Claims
-
1. A method for training an optical character recognition (OCR) system for a character pattern, the method comprising:
-
comparing the character pattern to images of the same character, wherein the images are associated with a database of character images; comparing the character pattern to images of different characters associated with the database of character images; calculating a weight value for a first classifier for an image sample of the same character based on comparing the character pattern to images of the same character; calculating a weight value for the first classifier for an image sample of a different character based on comparing the character pattern to images of the different characters; calculating a weight value for a second classifier for an image sample of the same character based on comparing the character pattern to images of the same character; calculating a weight value for the second classifier for an image sample of a different character based on comparing the character pattern to images of the different characters; for each weight value for the first classifier, calculating a corresponding normalized weight for the first classifier, based on a correlation between same character and different character samples of characters which received a same weight value by comparing the said calculated character pattern weight values; for each weight value for the second classifier, calculating a corresponding normalized weight for the second classifier, based on a correlation between same character and different character samples of characters which received a same weight value by comparing the said calculated character pattern weight value; calculating corresponding normalized weight values; and combining the normalized weight values for the first classifier with those normalized weight values associated with the second classifier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An electronic device for training an optical character recognition (OCR) classifier, the device comprising:
-
a processor; a memory in electronic communication with the processor, the memory configured with instructions that cause the electronic device to; compare a character pattern to images of the same character, wherein the images of the same character are part of a collection of character images; compare the character pattern to images of different characters, wherein the images of the different characters are part of a collection of character images; calculate a weight value for an image sample of the same character based on comparing the character pattern to images of the same character; calculate a weight value for an image sample of a different character based on comparing the character pattern to images of the different character; for each weight value, calculate a corresponding normalized weight, based on a correlation between same character samples and different character samples for the character samples that received a same weight value by comparing the said calculated character pattern weight values; and store in said memory said normalized weights for use by the OCR classifier. - View Dependent Claims (11, 12, 13, 14)
-
-
15. One or more physical computer-accessible media encoded with instructions for performing a method, the method comprising:
-
comparing a character pattern to images of the same character, wherein the images of the same character are part of a collection of character images; comparing the character pattern to images of different characters, wherein the images of the different characters are part of a collection of character images; calculating a weight value for an image sample of the same character based on comparing the character pattern to images of the same character; calculating a weight value for an image sample of a different character based on comparing the character pattern to images of the different characters; for each weight value, calculating a corresponding normalized weight, based on a correlation between same character samples and different character samples for the character samples that received a same weight value by comparing the said calculated character pattern weight values; and storing in a computer-accessible memory said normalized weights for use by the OCR classifier. - View Dependent Claims (16, 17, 18, 19)
-
Specification