Kernels for identifying patterns in datasets containing noise or transformation invariances
First Claim
Patent Images
1. A computer-implemented method for analyzing data containing a noise component, the method comprising:
- inputting the data into a computing environment comprising one or more pre-processing program modules and one or more support vector machine modules stored on a drive or a system memory of a computer or computer network;
dividing the data into a training dataset and a test dataset;
associating each datapoint within the training dataset with a tangent vector by applying a local transformation by the noise component to the datapoint;
mapping the training dataset and the tangent vectors into feature space;
in feature space, training a support vector machine comprising a kernel function to calculate a hyperplane for separating the training dataset into two or more classes, wherein the hyperplane has a normal vector that is orthogonal to the tangent vectors;
testing the trained support vector machine using the test dataset to determine whether an optimal solution has been achieved;
if the optimal solution has been achieved, inputting a new dataset having unknown classifications into the support vector machine; and
generating an output comprising an identification of patterns identified in the new dataset to one or more of the system memory, the drive, an external memory, and a display device.
3 Assignments
0 Petitions
Accused Products
Abstract
Learning machines, such as support vector machines, are used to analyze datasets to recognize patterns within the dataset using kernels that are selected according to the nature of the data to be analyzed. Where the datasets include an invariance transformation or noise, tangent vectors are defined to identify relationships between the invariance or noise and the training data points. A covariance matrix is formed using the tangent vectors, then used in generation of the kernel, which may be based on a kernel PCA map.
22 Citations
16 Claims
-
1. A computer-implemented method for analyzing data containing a noise component, the method comprising:
-
inputting the data into a computing environment comprising one or more pre-processing program modules and one or more support vector machine modules stored on a drive or a system memory of a computer or computer network; dividing the data into a training dataset and a test dataset; associating each datapoint within the training dataset with a tangent vector by applying a local transformation by the noise component to the datapoint; mapping the training dataset and the tangent vectors into feature space; in feature space, training a support vector machine comprising a kernel function to calculate a hyperplane for separating the training dataset into two or more classes, wherein the hyperplane has a normal vector that is orthogonal to the tangent vectors; testing the trained support vector machine using the test dataset to determine whether an optimal solution has been achieved; if the optimal solution has been achieved, inputting a new dataset having unknown classifications into the support vector machine; and generating an output comprising an identification of patterns identified in the new dataset to one or more of the system memory, the drive, an external memory, and a display device. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented method for analyzing data containing a noise component, the method comprising:
-
generating a training dataset comprising measurements taken on a sample; generating p additional measurements on the sample; inputting the training dataset and the p additional measurements into a computing environment comprising one or more pre-processing program modules and one or more support vector machine modules stored on a drive or a system memory of a computer or computer network; constructing p noise vectors; using a kernel PCA map, mapping the training dataset and the p noise vectors to span a subset in feature space; training a support vector machine comprising a kernel function to compute a decision boundary for separating the training dataset wherein substantially all of the p noise vectors are orthogonal to the decision boundary; testing the trained support vector machine using the test dataset to determine whether an optimal solution has been achieved; if the optimal solution has been achieved, inputting a new dataset having unknown classifications into the support vector machine; and generating an output comprising an identification of patterns identified in the new dataset to one or more of the system memory, the drive, an external memory, and a display device. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A computer program product embodied on a computer readable medium for predicting patterns in data containing a noise component, the computer program product comprising instructions for executing support vector machine classifiers and further for causing a computer processor to:
-
receive each of a training dataset comprising measurements taken on a sample and p additional measurements on the sample; construct p noise vectors; use a kernel PCA map to map the training dataset and the p noise vectors to span a subset in feature space; train a support vector machine comprising a kernel function to compute a decision boundary for separating the training dataset wherein substantially all of the p noise vectors are orthogonal to the decision boundary; test the trained support vector machine using a test dataset to determine whether an optimal solution has been achieved; if the optimal solution has been achieved, receive a new dataset having unknown classifications into the support vector machine; and generate an output comprising an identification of patterns identified in the new dataset to one or more of the system memory, the drive, an external memory, and a display device. - View Dependent Claims (13, 14, 15, 16)
-
Specification