Speech detection device for the detection of speech end points based on variance of frequency band limited energy
First Claim
Patent Images
1. A device for detecting speech in an input signal comprising:
- first determining means for determining a plurality of values representative of a plurality of frequency band limited energy within the signal, wherein the signal is sampled at a predetermined sampling rate in a single frequency band over a first plurality of frames, wherein each frame comprises a plurality of samples;
second determining means for receiving the plurality of values from said first determining means, and determining a variance of the frequency band limited energy of the signal in the single frequency band over a second plurality of frames;
third determining means for determining beginning and ending points of speech within the signal using the variance of the frequency band limited energy; and
a signal recording device including;
means for receiving the signal;
means for storing the most recent m seconds of the received signal; and
means for selecting the portion of the stored signal that corresponds to the start and the end points determined by said third determining means.
2 Assignments
0 Petitions
Accused Products
Abstract
The device detects the beginning and ending portions of speech contained within an input signal based on the variance of frequency band limited energy within the signal. The use of the variance allows detection which is relatively independent of an absolute signal-to-noise ratio with the signal, and allows accurate detection within a wide variety of backgrounds such as music, motor noise, and background noise, such as other speakers. The device can be easily implemented using off-the-shelf hardware along with a high-speed special purpose digital signal processor integrated circuit.
-
Citations
7 Claims
-
1. A device for detecting speech in an input signal comprising:
-
first determining means for determining a plurality of values representative of a plurality of frequency band limited energy within the signal, wherein the signal is sampled at a predetermined sampling rate in a single frequency band over a first plurality of frames, wherein each frame comprises a plurality of samples; second determining means for receiving the plurality of values from said first determining means, and determining a variance of the frequency band limited energy of the signal in the single frequency band over a second plurality of frames; third determining means for determining beginning and ending points of speech within the signal using the variance of the frequency band limited energy; and a signal recording device including; means for receiving the signal; means for storing the most recent m seconds of the received signal; and means for selecting the portion of the stored signal that corresponds to the start and the end points determined by said third determining means. - View Dependent Claims (2, 3)
-
-
4. A device for detecting speech in an input signal comprising:
first determining means for determining a plurality of values representative of a plurality of frequency band limited energy within the signal, wherein the signal is sampled at a predetermined sampling rate in a single frequency band over a first plurality of frames, wherein each frame comprises a plurality of samples, said first determining means including; means for calculating the energy of the frequency band limited signal; and means for applying a smoothing function to energy of the frequency band limited signal to generate the frequency band limited energy; second determining means for receiving the plurality of values from said first determining means, and determining a variance of the frequency band limited energy of the signal in the single frequency band over a second plurality of frames; and third determining means for determining beginning and ending points of speech within the signal using the variance of the frequency band limited energy. - View Dependent Claims (5, 6, 7)
Specification