Methods and apparatus for performing transformation techniques for data clustering and/or classification
First Claim
1. A method of classifying input data representing speech input by a user of a speech application as belonging to one of a plurality of classifications, the plurality of classifications associated with a respective plurality of clusters that were fit to training data representing a plurality of speech utterances associated with the speech application, the method comprising:
- using at least one processor to perform;
obtaining a first transformation generated based, at least in part, on a plurality of representative vectors determined from the training data, the plurality of representative vectors including at least one representative vector determined for each of the plurality of classifications, wherein the plurality of clusters were fit to the training data at least in part by fitting the plurality of clusters to transformed training data obtained by applying the first transformation to the training data;
transforming the input data using at least the first transformation to obtain transformed input data representing the speech input by the user of the speech application;
comparing the transformed input data to the plurality of clusters to determine which cluster of the plurality of clusters the input data should be associated with; and
classifying the input data according to a classification of the plurality of classifications associated with the determined cluster.
2 Assignments
0 Petitions
Accused Products
Abstract
Some aspects include transforming data, at least a portion of which has been processed to determine at least one representative vector associated with each of a plurality of classifications associated with the data to obtain a plurality of representative vectors. Techniques comprise determining a first transformation based, at least in part, on the plurality of representative vectors, applying at least the first transformation to the data to obtain transformed data, and fitting a plurality of clusters to the transformed data to obtain a plurality of established clusters. Some aspects include classifying input data by transforming the input data using at least the first transformation and comparing the transformed input data to the established clusters.
51 Citations
24 Claims
-
1. A method of classifying input data representing speech input by a user of a speech application as belonging to one of a plurality of classifications, the plurality of classifications associated with a respective plurality of clusters that were fit to training data representing a plurality of speech utterances associated with the speech application, the method comprising:
using at least one processor to perform; obtaining a first transformation generated based, at least in part, on a plurality of representative vectors determined from the training data, the plurality of representative vectors including at least one representative vector determined for each of the plurality of classifications, wherein the plurality of clusters were fit to the training data at least in part by fitting the plurality of clusters to transformed training data obtained by applying the first transformation to the training data; transforming the input data using at least the first transformation to obtain transformed input data representing the speech input by the user of the speech application; comparing the transformed input data to the plurality of clusters to determine which cluster of the plurality of clusters the input data should be associated with; and classifying the input data according to a classification of the plurality of classifications associated with the determined cluster. - View Dependent Claims (2, 3, 4, 5, 6, 7, 22)
-
8. At least one non-transitory computer readable storage medium for storing instructions that, when executed on at least one processor, perform a method of classifying input data representing speech input by a user of a speech application as belonging to one of a plurality of classifications, the plurality of classifications associated with a respective plurality of clusters that were fit to training data representing a plurality of speech utterances associated with the speech application, the method comprising:
-
obtaining a first transformation generated based, at least in part, on a plurality of representative vectors determined from the training data, the plurality of representative vectors including at least one representative vector determined for each of the plurality of classifications, wherein the plurality of clusters were fit to the training data at least in part by fitting the plurality of clusters to transformed training data obtained by applying the first transformation to the training data; transforming the input data using at least the first transformation to obtain transformed input data representing the speech input by the user of the speech application; comparing the transformed input data to the plurality of clusters to determine which cluster of the plurality of clusters the input data should be associated with; and classifying the input data according to a classification of the plurality of classifications associated with the determined cluster. - View Dependent Claims (9, 10, 11, 12, 13, 14, 23)
-
-
15. A system for classifying input data representing speech input by a user of a speech application as belonging to one of a plurality of classifications, the plurality of classifications associated with a respective plurality of clusters that were fit to training data representing a plurality of speech utterances associated with the speech application, the method comprising:
-
at least one computer readable storage medium for storing the input data and for storing a first transformation generated based, at least in part, on a plurality of representative vectors determined from the training data, the plurality of representative vectors including at least one representative vector determined for each of the plurality of classifications, wherein the plurality of clusters were fit to the training data at least in part by fitting the plurality of clusters to transformed training data obtained by applying the first transformation to the training data; and at least one processor capable of accessing the at least one computer readable storage medium, the at least one processor configured to; transform the input data using at least the first transformation to obtain transformed input data representing the speech input by the user of the speech application; compare the transformed input data to the plurality of clusters to determine which cluster of the plurality of clusters the input data should be associated with; and classify the input data according to a classification of the plurality of classifications associated with the determined cluster. - View Dependent Claims (16, 17, 18, 19, 20, 21, 24)
-
Specification