Supervised training augmented polynomial method and apparatus for character recognition
First Claim
1. A method of character classification, comprising the steps of:
- (a) performing a polynomial discriminant weight matrix creation operation using a training character set of training character images;
(b) testing the weight matrix to determine classification accuracy;
(c) modifying the training character set responsive to the testing by increasing representation of a character that is incorrectly classified or distinguished in classification strength from an incorrect classification by less than a difference;
(d) repeating steps (a)-(c) for a number of iterations; and
(e) classifying an unclassified character in an image using the weight matrix.
10 Assignments
0 Petitions
Accused Products
Abstract
A system that classifies or identifies hand printed or machine printed alphanumeric characters using an iterated least squares polynomial discriminant method is disclosed. Super-sampling of labeled training characters is performed prior to feature extraction. Feature extraction subsamples individual character images. The feature vectors and associated labelings are used to create a weight matrix. During iteration, the weight matrix, to be subsequently used for identification, is modified by determining which characters are incorrectly classified, or classified with too small a confidence, and replicating those characters during training to strengthen the correct classification. The correct classification is also strengthened by using negative feedback to inhibit incorrect classification by using negative values in the target vectors. The speed of the learning process is enhanced by maintaining intermediate matrices and step wise increasing the amount of each feature vector and the size of the sample set used during training. The memory efficiency utilization is enhanced by maintaining and modifying intermediate matrices and compressing the sparse binary features vectors.
-
Citations
13 Claims
-
1. A method of character classification, comprising the steps of:
-
(a) performing a polynomial discriminant weight matrix creation operation using a training character set of training character images; (b) testing the weight matrix to determine classification accuracy; (c) modifying the training character set responsive to the testing by increasing representation of a character that is incorrectly classified or distinguished in classification strength from an incorrect classification by less than a difference; (d) repeating steps (a)-(c) for a number of iterations; and (e) classifying an unclassified character in an image using the weight matrix. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of character classification, comprising the steps of:
-
(a) creating, using a computer, a polynomial discriminant character weight matrix using a training exemplar feature vector set of features of captured character images; (b) testing the weight matrix to determine classification accuracy; (c) increasing representation of an exemplar in the feature vector set which is not classified correctly by repeating the exemplar; (d) adding a negative weight for an exemplar that is classified incorrectly; (e) repeating steps (a)-(d) for a predetermined number of iterations; and (f) classifying an unclassified character in using the weight matrix, thereby identifying the unidentified character.
-
-
9. A method, comprising the steps of:
-
(a) capturing classified printed character images using a printed character image capture device; and (b) using a computer for; (b1) normalizing the images; (b2) offset replicating the normalized images; (b3) subsampling the super-sampled normalized images to produce a set of subsampled images; (b4) extracting character features from the set of subsampled images to create a set of feature vectors for each of the images; (b5) selecting a subset of the feature vector set; (b6) selecting a prefix of each feature vector; (b7) creating intermediate matrices for using the subset of the feature vector set and the prefix of each feature vector; (b8) creating a weight matrix from the intermediate matrices; (b9) testing the weight matrix using a test feature vector set with test characters; (b10) increasing the representation of test characters by replicating the test characters in the intermediate matrices that are not correctly classified by said testing; (b11) representing incorrect classifications for test characters negatively in the intermediate matrices; (b12) adjusting the subset of the feature vector set selected; (b13) adjusting the prefix of each feature vector selected; and (b14) repeating steps (b7)-(b13) a predetermined number of times. - View Dependent Claims (10)
-
-
11. A system for classifying printed characters, comprising:
-
image capture means for capturing images of identified printed characters as a training set and for capturing unidentified printed characters; a computer coupled to said image capture means and comprising; weight matrix creation means for iteratively creating a weight matrix from the identified printed characters using polynomial discrimination during which the training set is modified by increasing representation of characters that are incorrectly classified; and identification means for identifying the unidentified printed characters using the weight matrix.
-
-
12. A system for classifying data, comprising:
-
first capture means for capturing identified data as a training set; a computer coupled to said first capture means and comprising weight matrix creation means for iteratively creating a weight matrix from the identified data using polynomial discrimination during which the training set is modified by increasing representation of identified data in the training set that is not correctly classified; second capture means for capturing unidentified data; and identification means for identifying the unidentified data using the weight matrix.
-
-
13. A method of data classification, comprising the steps of:
-
(a) performing, using a computer, a polynomial discriminant weight matrix creation operation using a training data set of training data; (b) testing the weight matrix using the computer to determine classification accuracy; (c) modifying the training data set using the computer responsive to the testing by increasing representation of data that is not classified correctly; (d) repeating steps (a)-(c) for a number of iterations; and (e) classifying unclassified data using the computer with the weight matrix.
-
Specification