Normalization of speech signals
First Claim
1. A method for overcoming the distortions in the spectrum-of-silence in a system for accepting words presented in a stream of continuous speech, processing the stream into amplitude histograms for respective frequencies, and carrying out recognition processes, characterized by(a) collecting amplitude histograms as a function of frequency;
- (b) extracting, for each frequency, the amplitude P(05) defining the 5th percentile and the amplitude P(95) defining the 95th percentile; and
(c) normalizing, for each frequency, the input amplitude A(in) of the speech signal to obtain an output signal amplitude ##EQU6##
1 Assignment
0 Petitions
Accused Products
Abstract
A method and a system are disclosed for normalizing a speech signal prior to a speech recognition process. In a preparatory procedure, a sample interval of speech is separated into thirty-one frequency bands and an amplitude histogram is generated for each band. From these histograms, the 5% percentile amplitude value P(05) and the 95% percentile amplitude value P(95) are extracted for each band and these values are stored for later reference. For actual normalization, the current speech signal is also divided into the same frequency band as in the preparatory procedure, and consecutive input amplitude values A(in) of each frequency band are modified, using the percentile values of the respective band, to obtain output values according to ##EQU1## The essential effect of this normalizing treatment is that the resulting long-term spectrum is given by the P(95) values and the spectrum of silence is given by the P(05) values. After normalization, all speech has the same silence spectrum and long-term spectrum.
-
Citations
10 Claims
-
1. A method for overcoming the distortions in the spectrum-of-silence in a system for accepting words presented in a stream of continuous speech, processing the stream into amplitude histograms for respective frequencies, and carrying out recognition processes, characterized by
(a) collecting amplitude histograms as a function of frequency; -
(b) extracting, for each frequency, the amplitude P(05) defining the 5th percentile and the amplitude P(95) defining the 95th percentile; and (c) normalizing, for each frequency, the input amplitude A(in) of the speech signal to obtain an output signal amplitude ##EQU6##
-
-
2. System for normalizing a speech signal for a subsequent analysis or recognition process, characterized by
means for generating an amplitude histogram for each of several frequency bands, comprising for each of k consecutive amplitude ranges in each frequency band, a histogram count of speech signal amplitudes falling into the respective amplitude range; -
means for extracting, for each frequency from its histogram, two percentile amplitude values P(05) and P(95) indicative of the 5th and 95th percentile of the histogram counts, respectively. means for storing said percentile amplitude values; and means responsive to consecutive amplitude values A(in) of a current speech signal and to the percentile amplitude values available in said storing means, for generating normalized speech signal amplitude values ##EQU7## for each said frequency band.
-
-
3. A method for normalization a speech signal for subsequent speech recognition, the method comprising the steps of:
-
(a) filtering an input speech signal into a plurality of frequency bands; (b) sampling the input speech signal at a prescribed sampling rate; (c) for one sample after another in an analysis interval of the input speech signal, detecting amplitudes values of the input speech signal in each frequency band; (d) generating, based on the amplitude values detected in step (c), a respective low percentile amplitude value P(lo) and a respective high percentile amplitude value P(hi) for each frequency band and storing the generated percentile amplitude values; and (e) for an amplitude value A(in) of the input speech signal in a given frequency band f, adapting the value A(in) to;
##EQU8## - View Dependent Claims (4, 5, 6, 7, 8, 9, 10)
-
Specification