System for selling a product utilizing audio content identification
First Claim
1. A method for selling products containing or relating to audio content, said method comprising the steps of:
- receiving a recorded audio content image;
generating audio identifying information for the audio content image based on detected events in the audio content image;
determining whether the audio identifying information generated for the audio content image matches audio identifying information in an audio content database; and
if the audio identifying information generated for the audio content image matches audio identifying information in the audio content database, charging a fee for at least one product containing or relating to audio content that corresponds to the matching audio identifying information, wherein the generating step includes the sub-step of;
detecting a plurality of events in the audio content image, each of the events being a crossing of the value of a first running average and the value of a second running average, wherein the first running average is an average over a first averaging period of a plurality of time dependent frequency components of the audio content image, and the second running average is an average over a second averaging period, which is different than the first averaging period, of the time dependent frequency components of the audio content image.
1 Assignment
0 Petitions
Accused Products
Abstract
It is determined whether audio identifying information generated for an audio content image matches audio identifying information in an audio content database. If the audio identifying information generated for the audio content image matches audio identifying information in the audio content database, at least one product containing or relating to audio content that corresponds to the matching audio identifying information is identified. In one embodiment, the audio content image is received, and the audio identifying information is generated for the audio content image. In another embodiment, the audio identifying information for the audio content image is received. Also provided is a system for selling products.
242 Citations
34 Claims
-
1. A method for selling products containing or relating to audio content, said method comprising the steps of:
-
receiving a recorded audio content image;
generating audio identifying information for the audio content image based on detected events in the audio content image;
determining whether the audio identifying information generated for the audio content image matches audio identifying information in an audio content database; and
if the audio identifying information generated for the audio content image matches audio identifying information in the audio content database, charging a fee for at least one product containing or relating to audio content that corresponds to the matching audio identifying information, wherein the generating step includes the sub-step of;
detecting a plurality of events in the audio content image, each of the events being a crossing of the value of a first running average and the value of a second running average, wherein the first running average is an average over a first averaging period of a plurality of time dependent frequency components of the audio content image, and the second running average is an average over a second averaging period, which is different than the first averaging period, of the time dependent frequency components of the audio content image. - View Dependent Claims (2)
wherein the receiving step includes the sub-step of receiving a transmitted audio content image of at least a portion of a song, and the charging step includes the sub-step of transmitting a recording of at least a song that corresponds to the matching audio identifying information.
-
-
3. A method for selling products containing audio content, said method comprising the steps of:
-
receiving a recorded audio content image;
generating audio identifying information for the audio content image based on detected events in the audio content image;
determining whether the audio identifying information generated for the audio content image matches audio identifying information in an audio content database; and
if the audio identifying information generated for the audio content image matches audio identifying information in the audio content database, generating a product containing audio content that corresponds to the matching audio identifying information, wherein the generating step includes the sub-step of;
detecting a plurality of events in the audio content image, each of the events being a crossing of the value of a first running average and the value of a second running average, wherein the first running average is an average over a first averaging period of a plurality of time dependent frequency components of the audio content image, and the second running average is an average over a second averaging period, which is different than the first averaging period, of the time dependent frequency components of the audio content image. - View Dependent Claims (4)
-
-
5. A method for selling products containing or relating to audio content, said method comprising the steps of:
-
determining whether audio identifying information generated for an audio content image matches audio identifying information in an audio content database; and
if the audio identifying information generated for the audio content image matches audio identifying information in the audio content database, identifying at least one product containing or relating to audio content that corresponds to the matching audio identifying information, wherein the audio identifying information for the audio content image was generated based on a plurality of events detected in the audio content image, each of the events being a crossing of the value of a first running average and the value of a second running average, the first running average is an average over a first averaging period of a plurality of time dependent frequency components of the audio content image, and the second running average is an average over a second averaging period, which is different than the first averaging period, of the time dependent frequency components of the audio content image. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
receiving the audio content image; and
generating the audio identifying information for the audio content image based on the events detected in the audio content image.
-
-
7. The method according to claim 6, wherein the generating step includes the sub-steps of:
-
obtaining an audio signal characterized by a time dependent power spectrum;
analyzing the spectrum to obtain the time dependent frequency components;
detecting a plurality of the events in the time dependent frequency components; and
producing the audio identifying information for the audio content image based on the events detected.
-
-
8. The method according to claim 7 wherein the sub-step of analyzing the spectrum includes:
-
sampling the audio signal to obtain a plurality of audio signal samples;
taking a plurality of subsets from the plurality of audio signal samples; and
performing a Fourier transform on each of the plurality of subsets to obtain a set of Fourier frequency components.
-
-
9. The method according to claim 6, wherein the generating step includes the sub-steps of:
-
performing a Fourier transformation of an audio signal into a time series of audio power dissipated over a first plurality of frequencies;
grouping the frequencies into a smaller second plurality of bands that each include a range of neighboring frequencies;
detecting power dissipation events in each of the bands; and
grouping together the power dissipation events from mutually adjacent bands at a selected moment so as to form the audio identifying information.
-
-
10. The method according to claim 5, further comprising the step of receiving the audio identifying information for the audio content image.
-
11. The method according to claim 5, further comprising the step of charging a fee for the identified product.
-
12. The method according to claim 5, wherein the audio identifying information is an audio feature signature that is based on the events detected in the audio content image.
-
13. The method according to claim 12, wherein the determining step includes the of comparing the audio feature signature generated for the audio content image with the audio feature signatures stored in the audio content database.
-
14. The method according to claim 5, further comprising the steps of:
-
generating audio identifying information corresponding to predetermined audio content; and
storing the audio identifying information corresponding to the predetermined audio content in the audio content database.
-
-
15. The method according to claim 5, further comprising the step of charging a fee for identifying the audio content that corresponds to the matching audio identifying information.
-
16. A computer-readable medium encoded with a program for selling products containing audio content, said program containing instructions for performing the steps of:
-
receiving a recorded audio content image;
generating audio identifying information for the audio content image based on detected events in the audio content image;
determining whether the audio identifying information generated for the audio content image matches audio identifying information in an audio content database; and
if the audio identifying information generated for the audio content image matches audio identifying information in the audio content database, generating a product containing audio content that corresponds to the matching audio identifying information, wherein the generating step includes the sub-step of;
detecting a plurality of events in the audio content image, each of the events being a crossing of the value of a first running average and the value of a second running average, wherein the first running average is an average over a first averaging period of a plurality of time dependent frequency components of the audio content image, and the second running average is an average over a second averaging period, which is different than the first averaging period, of the time dependent frequency components of the audio content image. - View Dependent Claims (17)
-
-
18. A computer-readable medium encoded with a program for selling products containing or relating to audio content, said program containing instructions for performing the steps of:
-
determining whether audio identifying information generated for an audio content image matches audio identifying information in an audio content database; and
if the audio identifying information generated for the audio content image matches audio identifying information in the audio content database, identifying at least one product containing or relating to audio content that corresponds to the matching audio identifying information, wherein the audio identifying information for the audio content image was generated based on a plurality of events detected in the audio content image, each of the events being a crossing of the value of a first running average and the value of a second running average, the first running average is an average over a first averaging period of a plurality of time dependent frequency components of the audio content image, and the second running average is an average over a second averaging period, which is different than the first averaging period, of the time dependent frequency components of the audio content image. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26)
receiving the audio content image; and
generating the audio identifying information for the audio content image based on the events detected in the audio content image.
-
-
20. The computer-readable medium according to claim 19, wherein said program further contains instructions for performing the steps of:
-
generating audio identifying information corresponding to predetermined audio content; and
storing the audio identifying information corresponding to the predetermined audio content in the audio content database.
-
-
21. The computer-readable medium according to claim 19, wherein the generating step includes the sub-steps of:
-
obtaining an audio signal characterized by a time dependent power spectrum;
analyzing the spectrum to obtain the time dependent frequency components;
detecting a plurality of the events in the time dependent frequency components; and
producing the audio identifying information for the audio content image based on the events detected.
-
-
22. The computer-readable medium according to claim 21, wherein the sub-step of analyzing the spectrum includes:
-
sampling the audio signal to obtain a plurality of audio signal samples;
taking a plurality of subsets from the plurality of audio signal samples; and
performing a Fourier transform on each of the plurality of subsets to obtain a set of Fourier frequency components.
-
-
23. The computer-readable medium according to claim 19, wherein the generating step includes the sub-steps of:
-
performing a Fourier transformation of an audio signal into a time series of audio power dissipated over a first plurality of frequencies;
grouping the frequencies into a smaller second plurality of bands that each include a range of neighboring frequencies;
detecting power dissipation events in each of the bands; and
grouping together the power dissipation events from mutually adjacent bands at a selected moment so as to form the audio identifying information.
-
-
24. The computer-readable medium according to claim 18, wherein said program further contains instructions for performing the step of receiving the audio identifying information for the audio content image.
-
25. The computer-readable medium according to claim 18, further comprising:
-
receiving a transmitted audio content image of at least a portion of a song; and
transmitting a recording of at least a song that corresponds to the matching audio identifying information.
-
-
26. The computer-readable medium according to claim 18,
wherein the audio identifying information is an audio feature signature that is based on the events detected in the audio content image, and the determining step includes the sub-step of comparing the audio feature signature generated for the audio content image with the audio feature signatures stored in the audio content database.
-
27. A system comprising:
-
an input interface for receiving a recorded audio content image;
an identifying information generator for generating audio identifying information for the audio content image based on detected events in the audio content image;
a match detector for determining whether the audio identifying information generated for the audio content image matches audio identifying information in an audio content database; and
a product generator for generating a product containing audio content that corresponds to the matching audio identifying information, if the audio identifying information generated for the audio content image matches audio identifying information in the audio content database, wherein the identifying information generator detects a plurality of events in the audio content image, each of the events being a crossing of the value of a first running average and the value of a second running average, the first running average is an average over a first averaging period of a plurality of time dependent frequency components of the audio content image, and the second running average is an average over a second averaging period, which is different than the first averaging period, of the time dependent frequency components of the audio content image.
-
-
28. A system comprising:
-
a match detector for determining whether audio identifying information generated for an audio content image matches audio identifying information in an audio content database; and
a product identifier for identifying at least one product containing or relating to audio content that corresponds to the matching audio identifying information, if the audio identifying information generated for the audio content image matches audio identifying information in the audio content database, wherein the audio identifying information for the audio content image was generated based on a plurality of events detected in the audio content image, each of the events being a crossing of the value of a first running average and the value of a second running average, the first running average is an average over a first averaging period of a plurality of time dependent frequency components of the audio content image, and the second running average is an average over a second averaging period, which is different than the first averaging period, of the time dependent frequency components of the audio content image. - View Dependent Claims (29, 30, 31, 32, 33, 34)
an input interface for receiving the audio content image; and
an identifying information generator for generating the audio identifying information for the audio content image based on the events detected in the audio content image.
-
-
30. The system according to claim 28, further comprising an input interface for receiving the audio identifying information for the audio content image.
-
31. The system according to claim 28, further comprising:
-
an output interface for transmitting a recording of at least a song that corresponds to the matching audio identifying information, wherein the input interface is adapted for receiving a transmitted audio content image of at least a portion of a song.
-
-
32. The system according to claim 28, wherein the audio identifying information is an audio feature signature that is based on the events detected in the audio content image.
-
33. The system according to claim 32, wherein the match detector compares the audio feature signature generated for the audio content image with the audio feature signatures stored in the audio content database.
-
34. The system according to claim 28, wherein the audio content database stores audio identifying information for predetermined audio content.
Specification