Method and system for selecting pattern recognition training vectors
First Claim
Patent Images
1. In a computer, a method of selecting a plurality of training vectors for a pattern recognition system, the method comprising the following steps:
- receiving a plurality of example signals representing a plurality of classes within an example space, each of the plurality of example signals being associated with a respective class;
defining a plurality of clusters, each of the plurality of clusters being associated with one of the plurality of classes;
assigning the plurality of example signals to the plurality of clusters as a function of a plurality of cluster-example distances;
determining whether at least one decision boundary needs greater definition, if so, increasing in number the plurality of clusters and repeating the step of assigning;
selecting the plurality of training vectors by sampling of the plurality of example signals from each of the plurality of clusters; and
fitting a polynomial expansion to the plurality of training vectors using a least squares estimate, wherein the polynomial expansion has a form ##EQU3## wherein x represents at least one element of a training vector, i, j, and n are integers, y represents a discriminant signal, a0 represents a zero-order coefficient, bi represents a first-order coefficient, and cij represents a second-order coefficient.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer-based method and system selects a plurality of training vectors for a pattern recognition system by creating a plurality of clusters and then uniformly sampling the clusters. Each of the clusters is associated with a particular class and includes a plurality of example signals. The example signals are assigned to the clusters as a function of cluster-example distances. If a cluster includes one or more overlapping example signals, the number of clusters associated with the overlapping cluster is increased and the example signals are re-assigned to the clusters.
-
Citations
32 Claims
-
1. In a computer, a method of selecting a plurality of training vectors for a pattern recognition system, the method comprising the following steps:
-
receiving a plurality of example signals representing a plurality of classes within an example space, each of the plurality of example signals being associated with a respective class; defining a plurality of clusters, each of the plurality of clusters being associated with one of the plurality of classes; assigning the plurality of example signals to the plurality of clusters as a function of a plurality of cluster-example distances; determining whether at least one decision boundary needs greater definition, if so, increasing in number the plurality of clusters and repeating the step of assigning; selecting the plurality of training vectors by sampling of the plurality of example signals from each of the plurality of clusters; and fitting a polynomial expansion to the plurality of training vectors using a least squares estimate, wherein the polynomial expansion has a form ##EQU3## wherein x represents at least one element of a training vector, i, j, and n are integers, y represents a discriminant signal, a0 represents a zero-order coefficient, bi represents a first-order coefficient, and cij represents a second-order coefficient. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. In a computer, a method of selecting a plurality of training vectors for a pattern recognition system, the method comprising the following steps:
-
receiving a plurality of example signals representing a plurality of classes within an example space, each of the plurality of example signals being associated with a respective class; generating a plurality of cluster centroids by applying a K-means clustering algorithm to the example signals, each of the cluster centroids corresponding to one of the classes; assigning each of the plurality of example signals to a closest one of the plurality of cluster centroids by performing a nearest-neighbor search between the plurality of cluster centroids and the plurality of example signals, whereby producing a plurality of clusters; determining whether at least one of the clusters includes a quantity of overlapping example signals that exceeds an overlap threshold, if so, increasing in number the plurality of cluster centroids and repeating the step of assigning; selecting the plurality of training vectors by a uniform sampling of the plurality of example signals from each of the plurality of clusters; and fitting a polynomial expansion to the plurality of training vectors using a least squares estimate wherein the polynomial expansion has a form ##EQU4## wherein x represents at least one element of a training vector, i, j, and n are integers, y represents a discriminant signal, a0 represents a zero-order coefficient, bi represents a first-order coefficient, and cij represents a second-order coefficient. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computer system for selecting a plurality of training vectors for a pattern recognition system, which comprises:
-
an input interface for receiving a plurality of example signals representing a plurality of classes within an example space, each of the plurality of example signals being associated with a respective class; a cluster processor, operatively associated with the input interface, for defining a plurality of clusters, each of the plurality of clusters being associated with one of the plurality of classes; a sorter, operatively associated with the cluster processor, for assigning the plurality of example signals to the plurality of clusters as a function of a plurality of cluster-example distances; a comparator, operatively associated with the sorter, for determining whether at least one of the clusters includes a quantity of overlapping example signals that exceeds an overlap threshold, if so, the comparator increasing in number the plurality of clusters associated with the overlapping example signals; a selector, operatively associated with the comparator, for selecting the plurality of training vectors by a uniform sampling of the plurality of example signals from each of the plurality of clusters; wherein the sorter re-assigns the plurality of example signals to the plurality of clusters in response to the comparator increasing in number the plurality of clusters; and regression means for fitting a polynomial expansion to the plurality of training vectors using a least squares estimate, wherein the polynomial expansion has a form ##EQU5## wherein x represents at least one element of a training vector, i, j, and n are integers, y represents a discriminant signal, a0 represents a zero-order coefficient, bi represents a first-order coefficient, and cij represents a second-order coefficient. - View Dependent Claims (17, 18, 19, 20, 21)
-
-
22. An article of manufacture, which comprises:
-
a computer-readable memory usable for causing a computer to select a plurality of training vectors for a pattern recognition system, the computer-readable memory having a structure defined by storing a computer program in the computer-readable memory; wherein the computer program includes a method for selecting the plurality of training vectors, the method comprising the following steps; receiving a plurality of example signals representing a plurality of classes within an example space, each of the plurality of example signals being associated with a respective class; defining a plurality of clusters, each of the plurality of clusters being associated with one of the plurality of classes; assigning the plurality of example signals to the plurality of clusters as a function of a plurality of cluster-example distances; determining whether at least one of the clusters includes a quantity of overlapping example signals that exceeds an overlap threshold, if so, increasing in number the plurality of clusters and repeating the step of assigning; selecting the plurality of training vectors by a uniform sampling of the plurality of example signals from each of the plurality of clusters; and fitting a polynomial expansion to the plurality of training vectors using a least squares estimate. - View Dependent Claims (23, 24, 25, 26, 27, 28)
-
-
29. A computer system for selecting a plurality of training vectors for a pattern recognition system, which comprises:
-
an input interface for receiving a plurality of example signals representing a plurality of classes within an example space, each of the plurality of example signals being associated with a respective class; a cluster processor, operatively associated with the input interface, for defining a plurality of clusters, each of the plurality of clusters being associated with one of the plurality of classes; a sorter, operatively associated with the cluster processor, for assigning the plurality of example signals to the plurality of clusters as a function of a plurality of cluster-example distances; a comparator, operatively associated with the sorter, for determining whether at least one of the clusters includes a quantity of overlapping example signals that exceeds an overlap threshold; a selector, operatively associated with the comparator, for selecting the plurality of training vectors by a uniform sampling of the plurality of example signals from each of the plurality of clusters; wherein the sorter re-assigns the plurality of example signals to the plurality of clusters in response to the comparator increasing in number the plurality of clusters; and a regression fitting module for fitting a polynomial expansion to the plurality of training vectors, wherein the polynomial expansion is a function of at least one element of a training vector and a plurality of coefficients, at least one of the plurality of coefficients comprising a second-order coefficient. - View Dependent Claims (30, 31, 32)
-
Specification