System and process for constructing optimized prototypes for pattern recognition using competitive classification learning
First Claim
1. A method for optimizing prototypes of model characters for character recognition, said prototypes including at least one prototype feature value vector selected for each true class I into which inputted training character samples, of said model characters, are organized, said method comprising the step of:
- for each training character sample, iteratively;
using a processor, electronically determining the nearest class M containing the nearest prototype to said training character sample,using said processor, if said nearest class M is different from said true class I of said training character sample, electronically updating the value of at least some of said prototypes by a value which depends on;
##EQU30## where;
n is the iteration number,OM is the distance from said training character sample to said nearest prototype of said nearest class M,OI is the distance from said training character sample to the nearest prototype of said true class I, andA(n) is a monotonically decreasing bandwidth parameter such that 0<
A(n+1)<
A(n) for all n.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and process for performing character recognition is disclosed wherein inputted characters are compared to prototypes maintained in a predetermined database of the system to determine the best matching character. To generate the prototype database for use in recognition, training character samples are inputted and classified and prototypes, having feature value vectors, are generated for each class. The prototypes are optimized to improve the recognition capabilities of the database. The prototype feature value vectors are updated by only small amounts for abnormal prototypes that are much closer to the nearest class M than to the true class I. In addition, the updating of the prototype feature value vectors is performed so as to minimize an error in the selection of the prototypes. Finally, the distance between a training character sample and a prototype is determined so that features which better distinguish one character from another have a greater weight in determining the distance than those features which do not distinguish one character from another as well.
-
Citations
10 Claims
-
1. A method for optimizing prototypes of model characters for character recognition, said prototypes including at least one prototype feature value vector selected for each true class I into which inputted training character samples, of said model characters, are organized, said method comprising the step of:
-
for each training character sample, iteratively; using a processor, electronically determining the nearest class M containing the nearest prototype to said training character sample, using said processor, if said nearest class M is different from said true class I of said training character sample, electronically updating the value of at least some of said prototypes by a value which depends on;
##EQU30## where;
n is the iteration number,OM is the distance from said training character sample to said nearest prototype of said nearest class M, OI is the distance from said training character sample to the nearest prototype of said true class I, and A(n) is a monotonically decreasing bandwidth parameter such that 0<
A(n+1)<
A(n) for all n. - View Dependent Claims (2, 3)
-
-
4. A method for optimizing prototypes of model characters for character recognition, said prototypes including at least one prototype feature value vector selected for each true class I into which inputted training character samples, of said model characters, are organized said method comprising the step of:
-
for each training character sample, iteratively; using a processor, electronically determining the nearest class M containing the nearest prototype to said training character sample, using said processor, if said nearest class M is different from said true class I of said training character sample, electronically updating the value of at least some of said prototypes by a value which depends on;
##EQU33## where;
n is the iteration number,OM is the distance from said training character sample to said nearest prototype of said nearest class M, OI is the distance from said training character sample to the nearest prototype of said true class I, and A(n) is a monotonically decreasing bandwidth parameter such that 0<
A(n+1)<
A(n) for all n,and further wherein each of said training character samples has a feature value vector x comprising feature values xi, and wherein said step of updating updates each ith feature value rqMMi of said feature value vector rqMM of said qMth prototype feature value vector of said class M by the value;
##EQU34## and updates each ith feature rqIIi value of said feature value vector rqII of said qIth nearest feature value vector of said class I by the value ##EQU35## where;
η
(n) is a monotonically decreasing function which depends on A(n),θ
i is a normalization factor associated with said ith feature.
-
-
5. A method for optimizing prototypes of model characters for character recognition, said prototypes including at least one prototype feature value vector selected for each true class I into which inputted training character samples, of said model characters, are organized, said method comprising the step of:
-
for each training character sample, iteratively; using a processor, electronically determining the nearest class M containing the nearest prototype to said training character sample, using said processor, if said nearest class M is different from said true class I of said training character sample, electronically updating the value of at least some of said prototypes by a value which depends on;
##EQU36## where;
n is the iteration number,OM is the distance from said training character sample to said nearest prototype of said nearest class M, OI is the distance from said training character sample to the nearest prototype of said true class I, and A(n) is a monotonically decreasing bandwidth parameter such that 0<
A(n+1)<
A(n) for all n,and wherein each of said training character samples has a feature value vector x comprising feature values xi where i is an index that takes on a value from 1 to the maximum feature index N, said step of determining further comprising the step of; using said processor, electronically determining the distance yjk between said training character sample feature value vector x and each jth prototype feature value vector rjk of each kth class, where k takes on a value from 1 to the maximum class index K, by evaluating;
##EQU37## where θ
i is a normalization factor associated with said ith feature, and further wherein said step of determining the nearest class M further comprises the step of;using said processor, electronically selecting, for each class k, the smallest distance zk of said distances yjk wherein each class k comprises Bk prototypes, by evaluating;
##EQU38## - View Dependent Claims (6)
-
-
7. A system for optimizing prototypes of model characters for character recognition, said prototypes including at least one prototype feature value vector selected for each true class I;
- said each true class I comprising inputted training character samples, of said model characters, said training character samples, of said model characters being organized into said each true class I;
said system comprising;a processor for, for each training character sample, iteratively, electronically determining the nearest class M containing the nearest prototype to said training character sample, and, if said nearest class M is different from said true class I of said training character sample, electronically updating the value of at least some of said prototypes by a value which depends on;
##EQU40## where;
n is the iteration number,OM is the distance from said training character sample to said nearest prototype of said nearest class M, OI is the distance from said training character sample to the nearest prototype of said true class I, and A(n) is a monotonically decreasing bandwidth parameter such that 0<
A(n+1)<
A(n) for all n.
- said each true class I comprising inputted training character samples, of said model characters, said training character samples, of said model characters being organized into said each true class I;
-
8. A system for optimizing prototypes of model characters for character recognition, said prototypes including at least one prototype feature value vector selected for each true class I into which inputted training character samples, of said model characters, are organized, which training character samples each comprise a feature value vector, said system comprising:
-
a neural network processor comprising a plurality of interconnected, hierarchically organized layers of processing nodes for iteratively processing each training character sample, one at a time, said neural network processor comprising; a first hidden layer, comprising a plurality of processing nodes including one node corresponding to each jth prototype of a class k for receiving each feature value xi of an inputted training character sample feature value vector x and determining a distance yjk from said training character sample feature value vector x to a corresponding jth prototype feature value vector rjk of a kth class, having prototype feature values rjki, wherein each of said distances is determined according to;
##EQU41## where N is the total number of features, 1≦
I≦
N, and θ
i is a normalization factor associated with said ith feature, and wherein a change in θ
i is determined in part by scaling θ
i by a monotonically decreasing feature normalization factor weighting function.
-
-
9. A system for optimizing prototypes of model characters for character recognition, said prototypes including at least one prototype feature value vector selected for each true class I into which inputted training character samples, of said model characters, are organized, which training character samples each comprise a feature value vector, said system comprising:
-
a neural network processor comprising a plurality of interconnected, hierarchically organized layers of processing nodes for iteratively processing each training character sample, one at a time, said neural network processor comprising; a first hidden layer, comprising a plurality of processing nodes including one node corresponding to each jth prototype of a class k for receiving each feature value xi of an inputted training character sample feature value vector x and determining a distance yjk from said training character sample feature value vector x to a corresponding jth prototype feature value vector rjk of a kth class, having prototype feature values rjki, wherein each of said distances is determined according to;
##EQU42## where N is the total number of features, 1<
i<
N, and θ
i is a normalization factor associated with said ith feature, and wherein said neural network processor further comprises;a second hidden layer, comprising a second plurality of processing nodes including one node corresponding to each kth class for receiving only those Bk distances yjk to prototypes of the same kth class as said class to which said kth node of said second hidden layer corresponds, and determining the minimum one zk of said Bk distances yjk, and an output layer comprising a third plurality of nodes including a first node corresponding to a nearest class M, for determining a distance OM from said training character sample to the nearest prototype of said nearest class M by determining the minimum distance of said distances zk, and a second node for determining a distance OI from said training character sample to the nearest prototype of said true class I by selecting said distance zk for k=I, wherein Bk is the total number of prototypes per class K. - View Dependent Claims (10)
-
Specification