System and methods for recognizing sound and music signals in high noise and distortion
4 Assignments
0 Petitions
Accused Products
Abstract
A method for recognizing an audio sample locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at reproducible locations within the file, while fingerprints represent features of the signal at or near the landmark timepoints. To perform recognition, landmarks and fingerprints are computed for the unknown sample and used to retrieve matching fingerprints from the database. For each file containing matching fingerprints, the landmarks are compared with landmarks of the sample at which the same fingerprints were computed. If a large number of corresponding landmarks are linearly related, i.e., if equivalent fingerprints of the sample and retrieved file have the same time evolution, then the file is identified with the sample. The method can be used for any type of sound or music, and is particularly effective for audio signals subject to linear and nonlinear distortion such as background noise, compression artifacts, or transmission dropouts. The sample can be identified in a time proportional to the logarithm of the number of entries in the database; given sufficient computational power, recognition can be performed in nearly real time as the sound is being sampled.
-
Citations
125 Claims
-
1-106. -106. (canceled)
-
107. A method for constructing database index for a database of media samples, comprising:
-
landmarking each media sample to generate a list of timepoints;
computing a fingerprint at or near each landmark, the finger print and corresponding landmark forming a landmark/fingerprint pair; and
generating an index set for the media sample, the index set including a list of at least one of the landmark/fingerprint pairs. - View Dependent Claims (108, 109, 110, 111, 112, 113, 114, 115, 116, 117)
-
-
118. A method for recognizing a media entity from a media sample, comprising:
-
generating correspondences between landmarks of the media sample and corresponding landmarks in a database index, the database index including landmarks and fingerprints for a plurality of media entities, wherein the landmarks of the media sample and the corresponding landmarks of database index have equivalent fingerprints; and
identifying a particular media entity from the plurality of media entities which matches the media sample, if a plurality of said correspondences between the media sample and the particular media entity have a relationship. - View Dependent Claims (119, 120, 121)
-
-
122. A method of recognizing a media sample, comprising:
-
identifying media files that have file landmarks that are substantially linearly related to sample landmarks of the media sample;
wherein the file landmarks and the sample landmarks have equivalent fingerprints; and
wherein the file landmarks the said sample landmarks have a correspondence. - View Dependent Claims (123, 124, 125)
-
Specification