KERNELS AND METHODS FOR SELECTING KERNELS FOR USE IN LEARNING MACHINES
3 Assignments
0 Petitions
Accused Products
Abstract
Learning machines, such as support vector machines, are used to analyze datasets to recognize patterns within the dataset using kernels that are selected according to the nature of the data to be analyzed. Where the datasets possesses structural characteristics, locational kernels can be utilized to provide measures of similarity among data points within the dataset. The locational kernels are then combined to generate a decision function, or kernel, that can be used to analyze the dataset. Where an invariance transformation or noise is present, tangent vectors are defined to identify relationships between the invariance or noise and the data points. A covariance matrix is formed using the tangent vectors, then used in generation of the kernel.
26 Citations
29 Claims
-
1-20. -20. (canceled)
-
21. A computer-implemented method for analyzing data comprising a text document to identify patterns in words or characters within the document, the method comprising:
-
inputting the data into a computing environment comprising one or more pre-processing program modules and one or more support vector machine modules stored on a drive or a system memory of a computer or computer network by; dividing the data into a training dataset and a test dataset; defining a kernel for structured data for execution by the one or more support vector machine modules by representing the training dataset as a collection of sequences of words or characters and an index set within the document structure, wherein the indices within the index set correspond to locations of words or characters within the document; applying a vicinity function to the collection of sequences of words or characters to define a plurality of sequences of words or characters centered at different words or characters; measuring similarity of pairs of sequences of words or characters centered at the different indices to define a locational kernel having a value corresponding to each of the different pairs of sequences of words or characters; creating additional locational kernels by performing an operation selected from addition, scalar multiplication, multiplication, pointwise limits, transformation and convolution on the locational kernels; combining the locational kernels and the additional locational kernels for the different sequences of words or characters by performing an operation to produce a kernel on a set of sequences of words or characters; testing the kernel on the test data set having a known set of sequences of words or characters to determine whether an optimal solution has been achieved; if the optimal solution has been achieved, applying the kernel on a set of sequences of words or characters to a document having an unknown structure to identify patterns within, and thereby extract knowledge from, the document; and generating a display of the identified patterns of words or characters within the document having an unknown structure. - View Dependent Claims (22, 23, 24, 25, 26)
-
-
27. A computer-implemented method for analyzing data comprising a text document to identify patterns in words or characters within the document, the method comprising:
-
inputting the data into a memory and a computer processor programmed for executing one or more support vector machines; dividing the data into a training dataset and a test dataset; defining a kernel for structured data for execution by the one or more support vector machines by representing the training dataset as a collection of word or character strings and an index set within the document structure, wherein the indices within the index set correspond to locations of word or character strings within the document; applying a vicinity function to the collection of word or character strings to define a plurality of word or character strings centered at different words or characters; measuring similarity of pairs of word or character strings centered at the different indices to define a locational kernel having a value corresponding to each of the different pairs of word or character strings; creating additional locational kernels by performing an operation selected from addition, scalar multiplication, multiplication, pointwise limits, transformation and convolution on the locational kernels; combining the locational kernels and the additional locational kernels for the different word or character strings by performing an operation to produce a kernel on a set of word or character strings; testing the kernel on the test data set having a known set of word or character strings to determine whether an optimal solution has been achieved; if the optimal solution has been achieved, applying the kernel on a set of word or character strings to a document having an unknown structure to identify patterns within, and thereby extract knowledge from, the document; and generating a display of the identified patterns of words or characters within the document having an unknown structure. - View Dependent Claims (28, 29)
-
Specification