Apparatus for discriminating an audio signal as an ordinary vocal sound or musical sound
First Claim
1. An apparatus for discriminating an audio signal as one of vocal sound and musical sound, said apparatus comprising:
- pre-processing means for providing a vocal frequency band signal and a musical frequency band signal by separating said audio signal;
intermediate decision means, connected to said pre-processing means, for producing a plurality of decision signals respectively indicating whether the audio signal is one of said vocal sound and said musical sound, in response to detection of properties of said audio signal, said intermediate decision means comprising;
a first decision unit for producing a first decision signal by discriminating said audio signal as said vocal sound when said audio signal is monophonic;
a second decision unit for producing a second decision signal by desciminating said audio signal as said musical sound when said musical frequency band signal is detected having a sound pressure higher than a predetermined sound pressure;
a third decision unit for producing a third decision signal by discriminating said audio signal as said vocal sound when an envelope of said vocal frequency band signal is detected having an intermittence lower than a predetermined intermittence; and
a fourth decision unit for producing a fourth decision signal by discriminating said audio signal as said musical sound when said musical frequency band signal comprises a predetermined bandwidth; and
final decision means for producing a final decision signal indicating whether said audio signal is said one of said vocal sound and said musical sound by analyzing and comparing said first, second, third and fourth decision signals.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus for discriminating a received audio signal as vocal sound or musical sound includes a pre-processing circuit 100 for separating the audio signal into a vocal frequency band signal and a musical frequency band signal, an intermediate decision circuit having a plurality of decision units for producing a plurality of vocal and musical decision signals, each decision unit distinguishing whether vocal or musical frequency band signal includes properties of voice or music, and a final decision circuit 600 for systematically analyzing the vocal and musical decision signals to produce a final decision signal for discriminating the audio signal as the vocal or musical sound.
-
Citations
30 Claims
-
1. An apparatus for discriminating an audio signal as one of vocal sound and musical sound, said apparatus comprising:
-
pre-processing means for providing a vocal frequency band signal and a musical frequency band signal by separating said audio signal; intermediate decision means, connected to said pre-processing means, for producing a plurality of decision signals respectively indicating whether the audio signal is one of said vocal sound and said musical sound, in response to detection of properties of said audio signal, said intermediate decision means comprising; a first decision unit for producing a first decision signal by discriminating said audio signal as said vocal sound when said audio signal is monophonic; a second decision unit for producing a second decision signal by desciminating said audio signal as said musical sound when said musical frequency band signal is detected having a sound pressure higher than a predetermined sound pressure; a third decision unit for producing a third decision signal by discriminating said audio signal as said vocal sound when an envelope of said vocal frequency band signal is detected having an intermittence lower than a predetermined intermittence; and a fourth decision unit for producing a fourth decision signal by discriminating said audio signal as said musical sound when said musical frequency band signal comprises a predetermined bandwidth; and final decision means for producing a final decision signal indicating whether said audio signal is said one of said vocal sound and said musical sound by analyzing and comparing said first, second, third and fourth decision signals. - View Dependent Claims (2, 3, 4, 7, 8, 9, 10)
-
-
5. An apparatus for discriminating an audio signal as one of vocal sound and musical sound, said apparatus comprising:
-
pre-processing means for generating a vocal frequency band signal and a musical frequency band signal by separating said audio signal; first decision means for producing a first decision signal discriminating said audio signal as said vocal sound when said audio signal is monophonic; second decision means for producing a second decision signal discriminating said audio signal as said musical sound when said musical frequency band signal is detected having a musical frequency band comprising a low frequency band component and a high frequency band component, said musical sound of said musical frequency band having a sound pressure higher than a predetermined sound pressure; third decision means for producing a third decision signal discriminating said audio signal as said vocal sound when an envelope of said vocal frequency band signal is detected having an indicator of non-continuity being lower than a predetermined parameter of non-continuity; fourth decision means for producing a fourth decision signal discriminating said audio signal as said musical sound when said musical frequency band signal comprises a predetermined bandwidth; and final decision means for producing a final decision signal discriminating said audio signal as said one of said vocal sound and said musical sound by analyzing and comparing said first, second, third and fourth decision signals. - View Dependent Claims (6)
-
-
11. A method for discriminating an audio signal as one of vocal sound and musical sound, comprising the steps of:
-
generating a vocal frequency band signal and a musical frequency band signal by separating said audio signal; producing a plurality of decision signals by detecting a corresponding plurality of predefined properties of said audio signal, each of said plurality of predefined properties corresponding to one of said vocal sound and said musical sound; and producing a final decision signal indicating whether said audio signal is said one of said vocal sound and said musical sound by analyzing and comparing said plurality of decision signals. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A detector for detecting a vocal sound and a musical sound of an audio signal, said detector comprising:
-
a frequency band separator separating said audio signal into a vocal component and a musical component by separating the audio signal into a vocal frequency band and a musical frequency band; a processor, connected to said frequency band separator, comprising a plurality of decision circuits for producing a plurality of corresponding decision signals, each of said plurality of decision signals indicating that the audio signal is one of said vocal sound and said musical sound; and a final decision circuit producing a final decision signal indicating whether said audio signal is said one of said vocal sound and said musical sound by analyzing and comparing said plurality of decision signals. - View Dependent Claims (20, 21, 22, 23)
-
-
24. A signal processing apparatus for identifying an audio signal as one of a voice audio signal and a non-voice audio signal, comprising:
-
pre-processor means for processing said audio signal to generate first and second processed signals; first detector means for generating a first detected signal by detecting whether said audio signal is one of stereophonic and monophonic signals; second detector means, coupled to receive said first and second processed signals, for generating a second detected signal by detecting an intensity of high and low frequency components of said audio signal; third detector means, coupled to receive a first one of said first and second processed signals, for generating a third detected signal by detecting whether the intensity of the high and low components of said audio signal is continuous or intermittent; fourth detector means, coupled to receive a second one of said first and second processed signals, for generating a fourth detected signal by detecting peak frequency changes in a spectrum of said audio signal; and decision means for generating a final decision signal identifying whether the input audio signal is one of said voice audio signal and said non-voice audio signal in dependence upon a determination of the majority of the first, second, third and fourth detected signal. - View Dependent Claims (25, 26, 27, 28, 29, 30)
-
Specification