FEATURE EXTRACTION FOR IDENTIFICATION AND CLASSIFICATION OF AUDIO SIGNALS
First Claim
Patent Images
1. A method for extracting audio features from an audio frame, the method comprising:
- Pre-process the audio;
Transform the audio into a plurality of frequency sub-bands;
Store the said transformation results as sub-band coefficients in S, each sub-band coefficient correlates to the audio signal at a time slice with a certain frequency band.For every sub-band coefficients in S, subtract the mean value which corresponding to the same time slice, store the results in Z.For every sub-band in Z, weight and sum a plurality sequential chunk of sub-band coefficients;
For every sub-band, weight the above said result and differential encode with a sub-band coefficient in the same sub-band, store the results in B.Calculate the sum of all sub-band coefficients in the same time slice as the total band coefficients;
Weight and sum a plurality sequential chunk of total band coefficients;
Quantize the sub-band coefficients with at least one quantization threshold, which is derived from the said above operation, store the results in A.Calculate audio fingerprints from A and B.
0 Assignments
0 Petitions
Accused Products
Abstract
Characteristic features are extracted from an audio sample based on its acoustic content. The features can be coded as fingerprints, which can be used to identify the audio from a fingerprints database. The features can also be used as parameters to separate the audio into different categories.
162 Citations
20 Claims
-
1. A method for extracting audio features from an audio frame, the method comprising:
-
Pre-process the audio; Transform the audio into a plurality of frequency sub-bands; Store the said transformation results as sub-band coefficients in S, each sub-band coefficient correlates to the audio signal at a time slice with a certain frequency band. For every sub-band coefficients in S, subtract the mean value which corresponding to the same time slice, store the results in Z. For every sub-band in Z, weight and sum a plurality sequential chunk of sub-band coefficients; For every sub-band, weight the above said result and differential encode with a sub-band coefficient in the same sub-band, store the results in B. Calculate the sum of all sub-band coefficients in the same time slice as the total band coefficients; Weight and sum a plurality sequential chunk of total band coefficients; Quantize the sub-band coefficients with at least one quantization threshold, which is derived from the said above operation, store the results in A. Calculate audio fingerprints from A and B. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 20)
-
-
9. A method for identifying an unknown audio signal by employing audio fingerprints of a plurality of audio signals, comprising the steps of:
-
Calculating audio fingerprints for the said audio signal or for part of the audio signal, including; Pre-process the audio; Transform the audio into a plurality of frequency sub-bands; Store the said transformation results as sub-band coefficients in S, each sub-band coefficient correlates to the audio signal at a time slice with a certain frequency band. For every sub-band coefficients in S, subtract the mean value which corresponding to the same time slice, store the results in Z. For every sub-band in Z, weight and sum a plurality sequential chunk of sub-band coefficients; For every sub-band, weight the above said result and differential encode with a sub-band coefficient in the same sub-band, store the results in B. Calculate the sum of all sub-band coefficients in the same time slice as the total band coefficients; Weight and sum a plurality sequential chunk of total band coefficients; Quantize the sub-band coefficients with at least one quantization threshold, which is derived from the said above operation, store the results in A. Obtain audio fingerprints from A and B. Obtain same length fingerprints from the reference audio fingerprints; Compare the said two fingerprints to determine the similarity. - View Dependent Claims (10, 11, 12)
-
-
13. A system for extracting audio features from an audio frame, the system comprising:
-
Pre-process the audio; Transform the audio into a plurality of frequency sub-bands; Store the said transformation results as sub-band coefficients in S, each sub-band coefficient correlates to the audio signal at a time slice with a certain frequency band. For every sub-band coefficients in S, subtract the mean value which corresponding to the same time slice, store the results in Z. For every sub-band in Z, weight and sum a plurality sequential chunk of sub-band coefficients; For every sub-band, weight the above said result and differential encode with a sub-band coefficient in the same sub-band, store the results in B. Calculate the sum of all sub-band coefficients in the same time slice as the total band coefficients; Weight and sum a plurality sequential chunk of total band coefficients; Quantize the sub-band coefficients with at least one quantization threshold, which is derived from the said above operation, store the results in A. Calculate audio fingerprints from A and B. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification