Content recognition employing fingerprinting at different resolutions
First Claim
Patent Images
1. A method of content recognition comprising:
- receiving an audio signal captured from a microphone;
sampling the audio signal at a first resolution to provide a first sampled audio signal;
with a processor, computing at least a first audio fingerprint from the first sampled audio signal;
transferring the first audio fingerprint to a remote server, and in response, receiving metadata identifying a TV show, advertisement, movie or song from which the audio signal has been captured;
sampling the audio signal at a second resolution to provide a second sampled audio signal;
with a processor, computing at least a second audio fingerprint from the second sampled audio signal;
transferring the second audio fingerprint to a remote server, and in response to transferring the second audio fingerprint, receiving metadata to distinguish between distinct versions of the TV show, advertisement, movie or song from which the audio signal has been captured;
wherein the first and second fingerprints are derived from first and second different resolutions of the audio signal, corresponding to fingerprint databases corresponding to the first and second resolutions.
0 Assignments
0 Petitions
Accused Products
Abstract
Content fingerprints and watermarks are combined in various ways for content identification applications. Fingerprints are used to identify content generally while watermarks provide more detailed localization of parts within the content, and vice versa. Fingerprint techniques are further used for signal synchronization and other pre-processing steps to assist in digital watermark decoding. A variety of fingerprint/watermark techniques identify characteristics of the channel of content from content samples.
-
Citations
19 Claims
-
1. A method of content recognition comprising:
-
receiving an audio signal captured from a microphone; sampling the audio signal at a first resolution to provide a first sampled audio signal; with a processor, computing at least a first audio fingerprint from the first sampled audio signal; transferring the first audio fingerprint to a remote server, and in response, receiving metadata identifying a TV show, advertisement, movie or song from which the audio signal has been captured; sampling the audio signal at a second resolution to provide a second sampled audio signal; with a processor, computing at least a second audio fingerprint from the second sampled audio signal; transferring the second audio fingerprint to a remote server, and in response to transferring the second audio fingerprint, receiving metadata to distinguish between distinct versions of the TV show, advertisement, movie or song from which the audio signal has been captured;
wherein the first and second fingerprints are derived from first and second different resolutions of the audio signal, corresponding to fingerprint databases corresponding to the first and second resolutions. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of content recognition comprising:
-
receiving an audio signal captured from a microphone; sampling the audio signal at a first resolution to provide a first sampled audio signal; with a processor, computing at least a first audio fingerprint from the first sampled audio signal; transferring the first audio fingerprint to a remote server, and in response to transferring the first audio fingerprint, receiving metadata identifying a TV show, advertisement, movie or song from which the audio signal has been captured; sampling the audio signal at a second resolution to provide a second sampled audio signal; with a processor, computing at least a second audio fingerprint from the second sampled audio signal; transferring the second audio fingerprint to a remote server, and in response to transferring the second audio fingerprint, receiving metadata to distinguish between distinct versions of the TV show, advertisement, movie or song from which the audio signal has been captured; wherein at least the second fingerprint corresponds to a distinct pre-distorted version of the TV show, advertisement, movie or song from which the audio signal has been captured. - View Dependent Claims (7, 8, 9, 10, 11)
-
-
12. A system for content recognition comprising:
-
a microphone for capturing an audio signal; one or more processors programmed to; sample the audio signal at a first resolution to provide a first sampled audio signal; compute at least a first audio fingerprint from the first sampled audio signal; transfer the at least first audio fingerprint to a remote server, and in response to the transfer of the first audio fingerprint, receive metadata identifying a TV show, advertisement, movie or song from which the audio signal has been captured; sample the audio signal at a second resolution to provide a second sampled audio signal; compute at least a second audio fingerprint from the second sampled audio signal; and transfer the second audio fingerprint to a remote server, and in response to the transfer of the second audio fingerprint, receive metadata to distinguish between distinct versions of the TV show, advertisement, movie or song from which the audio signal has been captured;
the one or more processors being programmed to derive the first and second fingerprints from first and second different resolutions of the audio signal, the first and second fingerprints having corresponding databases of fingerprints for the first and second resolutions. - View Dependent Claims (13, 14, 15)
-
-
16. A system for content recognition comprising:
-
a microphone for capturing an audio signal; one or more processors programmed to; sample the audio signal at a first resolution to provide a first sampled audio signal; compute at least a first audio fingerprint from the first sampled audio signal; transfer the at least first audio fingerprint to a remote server, and in response to the transfer of the first audio fingerprint, receive metadata identifying a TV show, advertisement, movie or song from which the audio signal has been captured; sample the audio signal at a second resolution to provide a second sampled audio signal; compute at least a second audio fingerprint from the second sampled audio signal; and transfer the second audio fingerprint to a remote server, and in response to the transfer of the second audio fingerprint, receive metadata to distinguish between distinct versions of the TV show, advertisement, movie or song from which the audio signal has been captured;
wherein at least the second fingerprint corresponds to a distinct pre-distorted version of the TV show, advertisement, movie or song from which the audio signal has been captured. - View Dependent Claims (17, 18, 19)
-
Specification