AUDIO CLASSIFICATION FOR INFORMATION RETRIEVAL USING SPARSE FEATURES
First Claim
1. A computer-implemented method comprising:
- generating a collection of auditory images, each auditory image being generated from respective audio files according to an auditory model;
extracting sparse features from each auditory image in the collection to generate a sparse feature vector representing the corresponding audio file; and
ranking the audio files in response to a query including one or more words using the sparse feature vectors and a matching function relating sparse feature vectors to words in the query.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, are provided for using audio features to classify audio for information retrieval. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of generating a collection of auditory images, each auditory image being generated from respective audio files according to an auditory model; extracting sparse features from each auditory image in the collection to generate a sparse feature vector representing the corresponding audio file; and ranking the audio files in response to a query including one or more words using the sparse feature vectors and a matching function relating sparse feature vectors to words in the query.
55 Citations
33 Claims
-
1. A computer-implemented method comprising:
-
generating a collection of auditory images, each auditory image being generated from respective audio files according to an auditory model; extracting sparse features from each auditory image in the collection to generate a sparse feature vector representing the corresponding audio file; and ranking the audio files in response to a query including one or more words using the sparse feature vectors and a matching function relating sparse feature vectors to words in the query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method comprising:
-
receiving a text query, the query including one or more query terms; retrieving a matching function that relates keywords and sparse feature vectors, each sparse feature vector being derived from a particular audio file; identifying one or more keywords from the query terms; identifying one or more audio files responsive to the query using the matching function; and presenting search results identifying the one or more audio files.
-
-
12. A computer storage medium encoded with a computer program, the program comprising instructions that when executed by data processing apparatus cause the data processing apparatus to perform operations comprising:
-
generating a collection of auditory images, each auditory image being generated from respective audio files according to an auditory model; extracting sparse features from each auditory image in the collection to generate a sparse feature vector representing the corresponding audio file; and ranking the audio files in response to a query including one or more words using the sparse feature vectors and a matching function relating sparse feature vectors to words in the query. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A computer storage medium encoded with a computer program, the program comprising instructions that when executed by data processing apparatus cause the data processing apparatus to perform operations comprising:
-
receiving a text query, the query including one or more query terms; retrieving a matching function that relates keywords and sparse feature vectors, each sparse feature vector being derived from a particular audio file; identifying one or more keywords from the query terms; identifying one or more audio files responsive to the query using the matching function; and presenting search results identifying the one or more audio files.
-
-
23. A system comprising:
one or more computers configured to perform operations including; generating a collection of auditory images, each auditory image being generated from respective audio files according to an auditory model; extracting sparse features from each auditory image in the collection to generate a sparse feature vector representing the corresponding audio file; and ranking the audio files in response to a query including one or more words using the sparse feature vectors and a matching function relating sparse feature vectors to words in the query. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32)
-
33. A system comprising:
one or more computers configured to perform operations including; receiving a text query, the query including one or more query terms; retrieving a matching function that relates keywords and sparse feature vectors, each sparse feature vector being derived from a particular audio file; identifying one or more keywords from the query terms; identifying one or more audio files responsive to the query using the matching function; and presenting search results identifying the one or more audio files.
Specification