Device, method, and medium for generating audio fingerprint and retrieving audio data
First Claim
1. A device for generating an audio fingerprint, the device comprising:
- a coefficient extracting section to partially decode audio data in a compression area and to extract MDCT (Modified Discrete Cosine Transform) coefficients;
a coefficient selecting section to select an MDCT coefficient robust to noises from the extracted MDCT coefficients;
a modulation spectrum generating section to process the selected MDCT coefficient and to generate a modulation spectrum; and
a bit conversion section to quantize the generated modulation spectrum and to generate an audio fingerprint.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided are device, method, and medium for generating an audio fingerprint and retrieving audio data. The device for generating an audio fingerprint includes: a coefficient extracting section partially decoding audio data in a compression area and extracting MDCT (Modified Discrete Cosine Transform) coefficients; a coefficient selecting section selecting an MDCT coefficient robust to noises from the extracted MDCT coefficients; a modulation spectrum generating section transforming the selected MDCT coefficient by the use of a Fourier transform method and generating a modulation spectrum; and a bit conversion section quantizing the generated modulation spectrum and generating an audio fingerprint. As a result, it is possible to accurately and rapidly retrieve the audio data recorded in a variety of environments. Since elements based on MP3 are used, it is possible to apply to MP3 applications in various manners. In addition, it is possible to apply to classification of audio data such as classification of music moods and classification of music genres and various other fields such as extraction of a specific event from moving images of sports.
52 Citations
43 Claims
-
1. A device for generating an audio fingerprint, the device comprising:
-
a coefficient extracting section to partially decode audio data in a compression area and to extract MDCT (Modified Discrete Cosine Transform) coefficients;
a coefficient selecting section to select an MDCT coefficient robust to noises from the extracted MDCT coefficients;
a modulation spectrum generating section to process the selected MDCT coefficient and to generate a modulation spectrum; and
a bit conversion section to quantize the generated modulation spectrum and to generate an audio fingerprint. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A device for generating an audio fingerprint, the device comprising:
-
a coefficient extracting section to partially decode audio data in a compression area and to extract MDCT (Modified Discrete Cosine Transform) coefficients;
a synthesization section to synthesize PPF (PolyPhase Filter) sub bands from the extracted MDCT coefficients;
a band selecting section to select a PPF sub band robust to noises from the synthesized PPF sub bands;
a modulation spectrum generating section to process the selected PPF sub band and to generate a modulation spectrum; and
a bit conversion section to quantize the generated modulation spectrum and to generate an audio fingerprint. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A device for retrieving audio data, which corresponds to an audio fingerprint generated by partially decoding the audio data in a compression area, selecting an MDCT (Modified Discrete Cosine Coefficient) coefficient or a PPF (PolyPhase Filter) sub band, generating a modulation spectrum by transforming the selected MDCT coefficient or PPF sub band by the use of a Fourier transform method, and quantizing the generated modulation spectrum, the device comprising:
-
an audio fingerprint storage section to convert the audio data into an audio fingerprint and to store the audio fingerprint;
a hashing table to store a hashing value corresponding to the stored audio fingerprint;
a hashing value generating section to extract an indexing bit from the generated audio fingerprint and to generate a hashing value;
a table searching section to search the hashing table for the hashing value matched with the generated hashing value;
a BER calculating section to calculate a BER (Bit Error Ratio) between the generated audio fingerprint and the audio fingerprint corresponding to the searched hashing value;
a comparison section to compare the calculated BER with a threshold value; and
an audio fingerprint detecting section to detect the audio fingerprint in accordance with the comparison result. - View Dependent Claims (16, 17, 18)
-
-
19. A method of generating an audio fingerprint, the method comprising:
-
partially decoding audio data in a compression area and extracting MDCT (Modified Discrete Cosine Transform) coefficients;
selecting an MDCT coefficient robust to noises from the extracted MDCT coefficients;
processing the selected MDCT coefficient and generating a modulation spectrum; and
quantizing the generated modulation spectrum and generating an audio fingerprint. - View Dependent Claims (20, 21, 22, 23, 24, 25, 37)
-
-
26. A method of generating an audio fingerprint, the method comprising:
-
partially decoding audio data in a compression area and extracting MDCT (Modified Discrete Cosine Transform) coefficients;
synthesizing PPF (PolyPhase Filter) sub bands from the extracted MDCT coefficients;
selecting a PPF sub band robust to noises from the synthesized PPF sub bands;
processing the selected PPF sub band and generating a modulation spectrum; and
quantizing the generated modulation spectrum and generating an audio fingerprint. - View Dependent Claims (27, 28, 29, 30, 31, 32, 38)
-
-
33. A method of retrieving audio data, which corresponds to an audio fingerprint generated by partially decoding the audio data in a compression area, selecting an MDCT (Modified Discrete Cosine Coefficient) coefficient or a PPF (PolyPhase Filter) sub band, generating a modulation spectrum by transforming the selected MDCT coefficient or PPF sub band by the use of a Fourier transform method, and quantizing the generated modulation spectrum, the method comprising:
-
extracting an indexing bit from the generated audio fingerprint and generating a hashing value;
retrieving a hashing value matched with the generated hashing value from a hashing table storing a hashing value corresponding to an audio fingerprint stored in a storage medium;
calculating a BER (Bit Error Ratio) between the generated audio fingerprint and the audio fingerprint corresponding to the retrieved hashing value;
comparing the calculated BER with a threshold value; and
detecting the audio fingerprint in accordance with the comparison result. - View Dependent Claims (34, 35, 36, 39)
-
-
40. A method of generating an audio fingerprint, the method comprising:
-
selecting an MDCT (Modified Discrete Cosine Transform) coefficient robust to noises from MDCT coefficients extracted from audio data;
processing the selected MDCT coefficient and generating a modulation spectrum; and
quantizing the generated modulation spectrum and generating an audio fingerprint. - View Dependent Claims (41)
-
-
42. A method of generating an audio fingerprint, the method comprising:
-
synthesizing PPF (PolyPhase Filter) sub bands from MDCT (Modified Discrete Cosine Transform) coefficients extracted from audio data;
selecting a PPF sub band robust to noises from the synthesized PPF sub bands;
processing the selected PPF sub band and generating a modulation spectrum; and
quantizing the generated modulation spectrum and generating an audio fingerprint. - View Dependent Claims (43)
-
Specification