System and methods for recognizing sound and music signals in high noise and distortion
First Claim
1. A method for characterizing a media sample, the method being implemented in a computer and comprising:
- determining a first landmark and a second landmark via the computer, each landmark corresponding to a reproducible timepoint in the media sample;
determining a fingerprint, the fingerprint comprising a value corresponding to a vector between the first landmark and the second landmark; and
associating the fingerprint with the first landmark to form a landmark/fingerprint pair.
4 Assignments
0 Petitions
Accused Products
Abstract
A method for recognizing an audio sample locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at reproducible locations within the file, while fingerprints represent features of the signal at or near the landmark timepoints. To perform recognition, landmarks and fingerprints are computed for the unknown sample and used to retrieve matching fingerprints from the database. For each file containing matching fingerprints, the landmarks are compared with landmarks of the sample at which the same fingerprints were computed. If a large number of corresponding landmarks are linearly related, i.e., if equivalent fingerprints of the sample and retrieved file have the same time evolution, then the file is identified with the sample. The method can be used for any type of sound or music, and is particularly effective for audio signals subject to linear and nonlinear distortion such as background noise, compression artifacts, or transmission dropouts. The sample can be identified in a time proportional to the logarithm of the number of entries in the database; given sufficient computational power, recognition can be performed in nearly real time as the sound is being sampled.
65 Citations
20 Claims
-
1. A method for characterizing a media sample, the method being implemented in a computer and comprising:
-
determining a first landmark and a second landmark via the computer, each landmark corresponding to a reproducible timepoint in the media sample; determining a fingerprint, the fingerprint comprising a value corresponding to a vector between the first landmark and the second landmark; and associating the fingerprint with the first landmark to form a landmark/fingerprint pair. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for characterizing a media sample, the media sample being implemented in a computer and comprising:
-
landmarking the media sample via the computer to generate a list of reproducible timepoints; determining a first vector between a first landmark and a second landmark; determining a second vector between the first landmark and a third landmark; constructing a fingerprint, the fingerprint comprising the first vector and the second vector; and associating the fingerprint with the first landmark to a form a landmark/fingerprint pair. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A method of characterizing a media sample, the method being implemented in a computer and comprising:
-
generating a spectrogram via the computer of the media sample; identifying salient points, each salient point comprising at least a time coordinate and a frequency coordinate; assigning at least one of the salient points as an anchor point; determining a landmark from the time coordinate of the anchor point; and determining a fingerprint. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification