Classification system and method using combined information testing
First Claim
1. An information classification system combining training data and test data to classify a source which comprises:
- a vector processor formulating a plurality of feature vectors depicting characteristics of said test data and for receiving said plurality of feature vectors wherein each of said plurality of feature vectors is quantized to one of M symbols and for generating a quantized test vector y having as components the number of occurrences of each of the M symbols in said plurality of quantized feature vectors received;
means for storing quantized training data, said quantized training data having one quantized training vector xc for each class of a plurality of output classes; and
a classification processor, responsive to said quantized test vector and said quantized training data, for estimating sysmbol probabilities for each output class and for classifying the quantized test vector y unto one of said plurity output classes, wherein said classification processor estimates the symbol probabilities for each output class and classifies the quantized test vector using a combined Bayes test given by ##EQU15## wherein Xk,i is the number of occurrences of the ith symbol in a quantized training vector for class k;
xl,i is the number of occurrences of the ith symbol in a quantized training vector for class l;
yi is the number of occurrences of the ith symbol in the quantized test vector;
##EQU16## is the total number of occurrences of the M symbols in the training data for class k;
##EQU17## is the total number of occurrences of the M symbols in the training data for class l; and
##EQU18## is the total number of occurrences of the M symbols in the quantized feature vectors.
4 Assignments
0 Petitions
Accused Products
Abstract
A classification system uses sensors to obtain information from which feaes which characterized a source or object to classified can be extracted. The features are extracted from the information and compiled into a feature vector which is then quantized to one of one of M discrete symbols. After N feature vectors have been quantized, a test vector having components which are defined by the number of occurrences of each of the M symbols in N the quantized vectors is built. The system combines the test vector with training data to simultaneously estimate symbol probabilities for each class and classify the test vector using a decision rule that depends only on the training and test data. The system classifies the test vector using either a Combined Bayes test or a Combined Generalized likelihood ratio test.
-
Citations
2 Claims
-
1. An information classification system combining training data and test data to classify a source which comprises:
-
a vector processor formulating a plurality of feature vectors depicting characteristics of said test data and for receiving said plurality of feature vectors wherein each of said plurality of feature vectors is quantized to one of M symbols and for generating a quantized test vector y having as components the number of occurrences of each of the M symbols in said plurality of quantized feature vectors received; means for storing quantized training data, said quantized training data having one quantized training vector xc for each class of a plurality of output classes; and a classification processor, responsive to said quantized test vector and said quantized training data, for estimating sysmbol probabilities for each output class and for classifying the quantized test vector y unto one of said plurity output classes, wherein said classification processor estimates the symbol probabilities for each output class and classifies the quantized test vector using a combined Bayes test given by ##EQU15## wherein Xk,i is the number of occurrences of the ith symbol in a quantized training vector for class k;
xl,i is the number of occurrences of the ith symbol in a quantized training vector for class l;
yi is the number of occurrences of the ith symbol in the quantized test vector;
##EQU16## is the total number of occurrences of the M symbols in the training data for class k;
##EQU17## is the total number of occurrences of the M symbols in the training data for class l; and
##EQU18## is the total number of occurrences of the M symbols in the quantized feature vectors.
-
-
2. An information classification system combining training data and test data to classify a source which comprises:
-
a vector processor formulating a plurality of feature vectors depicting characteristics of said test data and for receiving said plurality of feature vectors wherein each of said plurality of feature vectors is quantized to one of M symbols and for generating a quantized test vector y having as components the number of occurrences of each of the M symbols in said plurality of quantized feature vectors received; means for storing quantized training data, said quantized training data having one quantized training vector xc for each class of a plurality of output classes; and a classification processor, responsive to said quantized test vector and said quantized training data, for estimating sysmbol probabilities for each output class and for classifying the quantized test vector y unto one of said plurity output classes, wherein said classification processor estimates the symbol probabilities for each one of C output classes and classifies the quantized test vector into one of the C output classes using a combined Bayes test given by ##EQU19## in which xk,i is the number of occurrences of the ith symbol in a quantized training vector for class k, yi is the number of occurrences of the ith symbol in the quantized test vector, Nk is the total number of occurrences of the M symbols in the training data for class k, Nl is the total number of occurrences of the M symbols in the training data for class l, and is the total number of occurrences of the M symbols in the quantized feature vectors.
-
Specification