Low-complexity music detection algorithm and system
First Claim
1. A method for detecting music in a speech signal having a plurality of frames, said method comprising:
- defining a music threshold value for a first parameter extracted from a frame of said speech signal;
defining a background noise threshold value for said first parameter;
defining an unsure threshold value for said first parameter, wherein said unsure threshold value falls between said music threshold value and said background noise threshold value;
wherein if said first parameter does not fall between said music threshold value and said background noise threshold value, classifying said speech signal as music if said first parameter is in closer range of said music threshold value than said unsure threshold value; and
classifying said speech signal as background noise if said first parameter is in closer range of said background noise threshold value than said unsure threshold value;
wherein if said first parameter falls between said music threshold value and said background noise threshold value, classifying said speech signal as music or background noise based on analyzing a plurality of first parameters extracted from said plurality of frames.
3 Assignments
0 Petitions
Accused Products
Abstract
A method for detecting music in a speech signal having a plurality of frames. The method comprises defining a music threshold value for a first parameter extracted from a frame of the speech signal, defining a background noise threshold value for the first parameter, and defining an unsure threshold value for the first parameter. The unsure threshold value falls between the music threshold value and the background noise threshold value. If the first parameter falls between the music threshold value and the background noise threshold value, the speech signal is classified as music or background noise based on analyzing a plurality of first parameters extracted from the plurality of frames.
-
Citations
36 Claims
-
1. A method for detecting music in a speech signal having a plurality of frames, said method comprising:
-
defining a music threshold value for a first parameter extracted from a frame of said speech signal;
defining a background noise threshold value for said first parameter;
defining an unsure threshold value for said first parameter, wherein said unsure threshold value falls between said music threshold value and said background noise threshold value;
wherein if said first parameter does not fall between said music threshold value and said background noise threshold value, classifying said speech signal as music if said first parameter is in closer range of said music threshold value than said unsure threshold value; and
classifying said speech signal as background noise if said first parameter is in closer range of said background noise threshold value than said unsure threshold value;
wherein if said first parameter falls between said music threshold value and said background noise threshold value, classifying said speech signal as music or background noise based on analyzing a plurality of first parameters extracted from said plurality of frames. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for detecting music in a speech signal having a plurality of frames, said system comprising:
-
a module for defining a music threshold value for a first parameter extracted from a frame of said speech signal;
a module for defining a background noise threshold value for said first parameter;
a module for defining an unsure threshold value for said first parameter, wherein said unsure threshold value falls between said music threshold value and said background noise threshold value;
a module for classifying said speech signal as music if said first parameter is in closer range of said music threshold value than said unsure threshold value, if said first parameter does not fall between said music threshold value and said background noise threshold value;
a module for classifying said speech signal as background noise if said first parameter is in closer range of said background noise threshold value than said unsure threshold value, if said first parameter does not fall between said music threshold value and said background noise threshold value;
a module for classifying said speech signal as music or background noise based on analyzing a plurality of first parameters extracted from said plurality of frames, if said first parameter falls between said music threshold value and said background noise threshold value. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A computer readable medium including computer software program executable by a processor for implementing a method of detecting music in a speech signal having a plurality of frames, said computer software program comprising:
-
code for defining a music threshold value for a first parameter extracted from a frame of said speech signal;
code for defining a background noise threshold value for said first parameter;
code for defining an unsure threshold value for said first parameter, wherein said unsure threshold value falls between said music threshold value and said background noise threshold value;
code for classifying said speech signal as music if said first parameter is in closer range of said music threshold value than said unsure threshold value, if said first parameter does not fall between said music threshold value and said background noise threshold value;
code for classifying said speech signal as background noise if said first parameter is in closer range of said background noise threshold value than said unsure threshold value, if said first parameter does not fall between said music threshold value and said background noise threshold value;
code for classifying said speech signal as music or background noise based on analyzing a plurality of first parameters extracted from said plurality of frames, if said first parameter falls between said music threshold value and said background noise threshold value. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
Specification