METHODS AND APPARATUS FOR PERFORMING TRANSFORMATION TECHNIQUES FOR DATA CLUSTERING AND/OR CLASSIFICATION
2 Assignments
0 Petitions
Accused Products
Abstract
Some aspects include transforming data, at least a portion of which has been processed to determine frequency information associated with features in the data. Techniques include determining a first transformation based, at least in part, on the frequency information, applying at least the first transformation to the data to obtain transformed data, and fitting a plurality of clusters to the transformed data to obtain a plurality of established clusters. Some aspects include classifying input data by transforming the input data using at least the first transformation and comparing the transformed input data to the established clusters.
-
Citations
72 Claims
-
1-33. -33. (canceled)
-
34. A method of classifying input data as belonging to one of a plurality of classifications, the plurality of classifications associated with a respective plurality of clusters that were fit to training data, the method comprising:
-
obtaining a first transformation used to transform the training data when the plurality of clusters were fit to the training data, the first transformation based, at least in part, on frequency information associated with features that were represented in the training data; transforming the input data using at least the first transformation to obtain transformed input data; comparing the transformed input data to the plurality of clusters to determine which cluster of the plurality of clusters the input data should be associated with; and classifying the input data according to a classification of the plurality of classifications associated with the cluster that the input data was determined to be associated with. - View Dependent Claims (35, 36, 37, 38, 39, 40, 41, 42, 43, 44)
-
-
47. At least one computer readable storage medium storing instructions that, when executed by at least one processor, perform a method of classifying input data as belonging to one of a plurality of classifications, the plurality of classifications associated with a respective plurality of clusters that were fit to training data, the method comprising:
-
obtaining a first transformation used to transform the training data when the plurality of clusters were fit to the training data, the first transformation based, at least in part, on frequency information associated with features that were represented in the training data; transforming the input data using at least the first transformation to obtain transformed input data; comparing the transformed input data to the plurality of clusters to determine which cluster of the plurality of clusters the input data should be associated with; and classifying the input data according to a classification of the plurality of classifications associated with the cluster that the input data was determined to be associated with. - View Dependent Claims (45, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59)
-
-
60. A system for classifying input data as belonging to one of a plurality of classifications, the plurality of classifications associated with a respective plurality of clusters that were fit to training data, the system comprising:
-
at least one computer readable storage medium for storing the input data and for storing a first transformation used to transform the training data when the plurality of clusters were fit to the training data, the first transformation based, at least in part, on frequency information associated with features represented in the training data; and at least one processor capable of accessing the at least one computer readable storage medium, the at least one processor configured to; transform the input data using at least the first transformation to obtain transformed input data; compare the transformed input data to the plurality of clusters to determine which cluster of the plurality of clusters the input data should be associated with; and classify the input data according to a classification of the plurality of classifications associated with the cluster that the input data was determined to be associated with. - View Dependent Claims (61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72)
-
Specification