Audio Signal Classification Method and Apparatus
First Claim
1. An audio signal classification method, comprising:
- determining, according to voice activity of a current audio frame, whether to obtain a current frequency spectrum fluctuation parameter of the current audio frame and store the current frequency spectrum fluctuation parameter, wherein a frequency spectrum fluctuation parameter denotes an energy fluctuation of a frequency spectrum of an audio signal;
updating, according to whether the audio frame is percussive music, stored one or more frequency spectrum fluctuation parameters; and
classifying the current audio frame as a speech frame or a music frame according to statistics of a part or all of effective data of the stored frequency spectrum fluctuation parameters.
1 Assignment
0 Petitions
Accused Products
Abstract
An audio signal classification method and apparatus, where the method includes determining, according to voice activity of a current audio frame, whether to obtain a frequency spectrum fluctuation of the current audio frame and store the frequency spectrum fluctuation in a frequency spectrum fluctuation memory, and updating, according to whether the audio frame is percussive music or activity of a historical audio frame, frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory, and classifying the current audio frame as a speech frame or a music frame according to statistics of a part or all of effective data of the frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory.
45 Citations
30 Claims
-
1. An audio signal classification method, comprising:
-
determining, according to voice activity of a current audio frame, whether to obtain a current frequency spectrum fluctuation parameter of the current audio frame and store the current frequency spectrum fluctuation parameter, wherein a frequency spectrum fluctuation parameter denotes an energy fluctuation of a frequency spectrum of an audio signal; updating, according to whether the audio frame is percussive music, stored one or more frequency spectrum fluctuation parameters; and classifying the current audio frame as a speech frame or a music frame according to statistics of a part or all of effective data of the stored frequency spectrum fluctuation parameters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An audio signal classification method, comprising:
-
determining, according to voice activity of a current audio frame, whether to obtain a current frequency spectrum fluctuation parameter of the current audio frame and store the current frequency spectrum fluctuation parameter, wherein a frequency spectrum fluctuation parameter denotes an energy fluctuation of a frequency spectrum of an audio signal; updating, according to activity of a historical audio frame, stored one or more frequency spectrum fluctuation parameters; and classifying the current audio frame as a speech frame or a music frame according to statistics of a part or all of effective data of the stored frequency spectrum fluctuation parameters. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. An audio signal classification apparatus configured to classify an input audio signal, comprising:
-
a memory; and a processor coupled to the memory, wherein the processor is configured to determine, according to voice activity of a current audio frame, whether to obtain and store a current frequency spectrum fluctuation parameter of the current audio frame, wherein the current frequency spectrum fluctuation parameter denotes an energy fluctuation of a frequency spectrum of an audio signal, wherein the memory is configured to store one or more frequency spectrum fluctuation parameters when the processor outputs a result that the frequency spectrum fluctuation parameter needs to be stored; wherein the processor is further configured to; update, according to whether the audio frame is percussive music or activity of a historical audio frame, the frequency spectrum fluctuation parameters stored in the memory; and classify the current audio frame as a speech frame or a music frame according to statistics of a part or all of effective data of the frequency spectrum fluctuation parameters stored in the memory. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification