System and methods for continuous audio matching
First Claim
Patent Images
1. A non-transitory computer readable medium storing code that, when executed by one or more processors, causes the one or more processors to:
- send an audio query to a server;
responsive to the server matching the audio query with a reference item in a database, receive, from the server, an audio fingerprint sequence and an audio identifier associated with a predicted reference audio item;
update a watching cache with the audio fingerprint sequence and the associated audio identifier;
extract an input audio fingerprint from an audio signal; and
match the input audio fingerprint extracted from the audio signal to the audio fingerprint sequence stored in the watching cache and associated with the predicted reference audio item to identify the predicted reference audio item from the audio signal.
8 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to the continuous monitoring of an audio signal and identification of audio items within an audio signal. The technology disclosed utilizes predictive caching of fingerprints to improve efficiency. Fingerprints are cached for tracking an audio signal with known alignment and for watching an audio signal without known alignment, based on already identified fingerprints extracted from the audio signal. Software running on a smart phone or other battery-powered device cooperates with software running on an audio identification server.
225 Citations
10 Claims
-
1. A non-transitory computer readable medium storing code that, when executed by one or more processors, causes the one or more processors to:
-
send an audio query to a server; responsive to the server matching the audio query with a reference item in a database, receive, from the server, an audio fingerprint sequence and an audio identifier associated with a predicted reference audio item; update a watching cache with the audio fingerprint sequence and the associated audio identifier; extract an input audio fingerprint from an audio signal; and match the input audio fingerprint extracted from the audio signal to the audio fingerprint sequence stored in the watching cache and associated with the predicted reference audio item to identify the predicted reference audio item from the audio signal. - View Dependent Claims (2)
-
-
3. A non-transitory computer readable medium storing code that, when executed by one or more processors, causes the one or more processors to:
-
receive a plurality of reference audio fingerprint sequences into a tracking cache; select, from the plurality of received reference audio fingerprint sequences, a first candidate reference audio fingerprint sequence as a first potential match to an audio signal; select, from the plurality of received reference audio fingerprint sequences, a second candidate reference audio fingerprint sequence as a second potential match to the audio signal; maintain a first tracking alignment between a fingerprint sequence extracted from the audio signal and the first candidate reference audio fingerprint sequence; maintain a second tracking alignment between the fingerprint sequence extracted from the audio signal and the second candidate reference audio fingerprint sequence; and responsive to a failure of the first tracking alignment, resolving ambiguity by confirming that the audio signal comprises the second candidate reference audio fingerprint sequence. - View Dependent Claims (4)
-
-
5. A method of using a user device to monitor an audio signal and identify audio items within the audio signal, the method including:
-
responsive to the user device having sent initial audio fingerprints extracted from the audio signal, identifying an initial audio item in the initial audio fingerprints; responsive to the identification of the initial audio item, (i) updating a cache with one or more audio fingerprint sequences received from a server, the one or more audio fingerprint sequences being from one or more audio items predicted to follow the identified initial audio item, and (ii) updating the cache with respective audio item identifiers for the one or more audio items predicted to follow the identified initial audio item; and matching additional audio fingerprints extracted from the audio signal to the cached one or more audio fingerprint sequences from the one or more audio items predicted to follow the identified initial audio item, to identify an audio item within the audio signal as one of the one or more audio items predicted to follow the identified initial audio item. - View Dependent Claims (6, 7, 8, 9, 10)
-
Specification