Systems and methods for recognizing sound and music signals in high noise and distortion
First Claim
1. A method performed by a computing device, the method comprising:
- receiving a media sample of media subject to time stretching, wherein the time stretching of the media occurs based on the media being played at a speed different than an original recording speed;
generating a fingerprint of the media sample for content identification that is invariant with respect to time stretching of the media sample; and
performing a content identification of the media sample using the fingerprint by comparing the fingerprint to stored fingerprints in memory of a database index, wherein the stored fingerprints are based on media played at original recording speeds.
4 Assignments
0 Petitions
Accused Products
Abstract
A method for recognizing an audio sample locates an audio file that matches the sample from a database indexing original recordings. Each indexed file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at reproducible locations within the file, while fingerprints represent features of the signal at or near the landmark timepoints, and the fingerprints are invariant with respect to time stretching of the sample. To perform recognition, landmarks and fingerprints are computed for the unknown sample and used to retrieve matching fingerprints from the database. For each file containing matching fingerprints, the landmarks are compared with landmarks of the sample at which the same fingerprints were computed. If a large number of corresponding landmarks are linearly related, i.e., if equivalent fingerprints of the sample and retrieved file have the same time evolution, then the file is identified with the sample.
-
Citations
20 Claims
-
1. A method performed by a computing device, the method comprising:
-
receiving a media sample of media subject to time stretching, wherein the time stretching of the media occurs based on the media being played at a speed different than an original recording speed; generating a fingerprint of the media sample for content identification that is invariant with respect to time stretching of the media sample; and performing a content identification of the media sample using the fingerprint by comparing the fingerprint to stored fingerprints in memory of a database index, wherein the stored fingerprints are based on media played at original recording speeds. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system, comprising:
-
a computing device; and a database index for storing fingerprints of known media based on media played at original recording speeds; a memory, the memory configured to store instructions that when executed by the computing device cause the computing device to; receiving a media sample of media subject to time stretching, wherein the time stretching of the media occurs based on the media being played at a speed different than an original recording speed; generating a fingerprint of the media sample for content identification that is invariant with respect to time stretching of the media sample; and performing a content identification of the media sample using the fingerprint by comparing the fingerprint to stored fingerprints in the database index. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification