Parameterized temporal feature analysis
First Claim
1. A method for classifying at least one audio signal into at least one audio class, the method comprising the steps of:
- analyzing said audio signal to extract at least one predetermined audio feature;
performing a frequency analysis on a set of values of said extracted predetermined audio feature at different time instances resulting in a power spectrum of said extracted predetermined audio feature;
deriving at least one further audio feature representing a temporal behavior of said extracted predetermined audio feature by parameterizing said power spectrum, wherein parameterizing said power spectrum comprises (a) summarizing a frequency axis of the power spectrum by summing energy within at least one predetermined frequency band and (b) dividing (b)(i) the summed energy within the at least one predetermined frequency band by (b)(ii) an average of subsequent values alone of said extracted predetermined audio feature, not including preceding values, to (c) yield a relative modulation depth representing an amount of envelope modulation in the at least one predetermined frequency band; and
classifying said audio signal based on said further audio feature.
1 Assignment
0 Petitions
Accused Products
Abstract
A method (1) for classifying at least one audio signal (A) into at least one audio class (AC), the method (1) comprising the steps of analyzing (10) said audio signal to extract at least one predetermined audio feature, performing (12) a frequency analysis on a set of values of said audio feature at different time instances, deriving (12) at least one further audio feature representing a temporal behavior of said audio feature based on said frequency analysis, and classifying (14) said audio signal based on said further audio feature. With the further audio feature, information is obtained about the temporal fluctuation of an audio feature, which may be advantageous for a classification of audio.
-
Citations
11 Claims
-
1. A method for classifying at least one audio signal into at least one audio class, the method comprising the steps of:
-
analyzing said audio signal to extract at least one predetermined audio feature; performing a frequency analysis on a set of values of said extracted predetermined audio feature at different time instances resulting in a power spectrum of said extracted predetermined audio feature; deriving at least one further audio feature representing a temporal behavior of said extracted predetermined audio feature by parameterizing said power spectrum, wherein parameterizing said power spectrum comprises (a) summarizing a frequency axis of the power spectrum by summing energy within at least one predetermined frequency band and (b) dividing (b)(i) the summed energy within the at least one predetermined frequency band by (b)(ii) an average of subsequent values alone of said extracted predetermined audio feature, not including preceding values, to (c) yield a relative modulation depth representing an amount of envelope modulation in the at least one predetermined frequency band; and classifying said audio signal based on said further audio feature. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for classifying at least one audio signal into at least one audio class, the system comprising:
-
means for analyzing said audio signal to extract at least one predetermined audio feature; means for performing a frequency analysis on a set of values of said extracted predetermined audio feature at different time instances resulting in a power spectrum of said extracted predetermined audio feature; means for deriving at least one further audio feature representing a temporal behavior of said extracted predetermined audio feature by parameterizing said power spectrum, wherein parameterizing said power spectrum comprises (a) summarizing a frequency axis of the power spectrum by summing energy within at least one predetermined frequency band and (b) dividing (b)(i) the summed energy within the at least one predetermined frequency band by (b)(ii) an average of subsequent values alone of said extracted predetermined audio feature, not including preceding values, to (c) yield a relative modulation depth representing an amount of envelope modulation in the at least one predetermined frequency band; and means for classifying said audio signal based on said further audio feature. - View Dependent Claims (10, 11)
-
Specification