System for adaptive enhancement of speech signals
First Claim
1. A computer-implemented method of enhancing a frequency response of a received speech signal, the method comprising:
- performing, through the use of a processor, a frequency sub-band analysis on successive overlapping windowed buffers of the received speech signal to generate a compressed dB spectrum of the received speech signal for each successive overlapping windowed buffer;
adapting a running average of a spectral shape of speech based on a current compressed dB spectrum corresponding to one of the successive overlapping windowed buffers;
subtracting, through the use of the processor, the adapted running average of the spectral shape of speech from a target spectral shape, the difference between the target spectral shape and the adapted running average of the spectral shape of speech comprising a spectral shape correction factor; and
adding, through the use of the processor, the spectral shape correction factor to the current compressed dB spectrum.
9 Assignments
0 Petitions
Accused Products
Abstract
A method and system for enhancing the frequency response of speech signals are provided. An average speech spectral shape estimate is calculated over time based on the input speech signal. The average speech spectral shape estimate may be calculated in the frequency domain using a first order IIR filtering or “leaky integrators.” Thus, the average speech spectral shape estimate adapts over time to changes in the acoustic characteristics of the voice path or any changes in the electrical audio path that may affect the frequency response of the system. A spectral correction factor may be determined by comparing the average speech spectral shape estimate to a desired target spectral shape. The spectral correction factor may be added (in units of dB) to the spectrum of the input speech signal in order to enhance or adjust the spectrum of the input speech signal toward the desired spectral shape, and an enhanced speech signal re-synthesized from the corrected spectrum.
-
Citations
18 Claims
-
1. A computer-implemented method of enhancing a frequency response of a received speech signal, the method comprising:
-
performing, through the use of a processor, a frequency sub-band analysis on successive overlapping windowed buffers of the received speech signal to generate a compressed dB spectrum of the received speech signal for each successive overlapping windowed buffer; adapting a running average of a spectral shape of speech based on a current compressed dB spectrum corresponding to one of the successive overlapping windowed buffers; subtracting, through the use of the processor, the adapted running average of the spectral shape of speech from a target spectral shape, the difference between the target spectral shape and the adapted running average of the spectral shape of speech comprising a spectral shape correction factor; and adding, through the use of the processor, the spectral shape correction factor to the current compressed dB spectrum. - View Dependent Claims (2, 3, 4, 5, 6, 7, 9)
-
-
8. The method of cleaning 1 where the target spectral shape corresponds to an ideal spectral shape of a speech signal input to a voice recognition system.
-
10. A system for enhancing the frequency response of a speech signal comprising:
-
a microphone for capturing a speech signal; an A/D converter for converting the speech signal into a digital speech signal; and a processor adapted to continuously update a running average of a spectral shape of the speech signal received at the microphone, to subtract the continuously updated running average of the spectral shape of the speech signal from a target spectral shape, the difference between the target spectral shape and the adapted running average of the spectral shape of speech comprising a speech spectral shape correction factor, and to adjust the speech signal using the speech spectral shape correction factor. - View Dependent Claims (11, 12, 13)
-
-
14. A computer-implemented method of enhancing a frequency response of a speech signal comprising:
-
performing, through the use of a processor, a frequency sub-band analysis on successive overlapping windowed buffers of the speech signal to generate a compressed dB spectrum of the received speech signal for each successive overlapped windowed buffer; generating, through the use of the processor, a background noise estimate across the frequency sub-bands; generating, through the use of the processor, a background noise spectral shape correction factor by subtracting the background noise estimate from a target background noise spectral shape; and adding, through the use of the processor, the background noise spectral shape correction factor to a spectrum corresponding to one of the successive overlapping windowed buffers. - View Dependent Claims (15, 16, 17)
-
-
18. A computer-implemented method of enhancing a frequency response of a speech signal comprising:
-
performing, through the use of a processor, a frequency sub-band analysis on successive overlapping windowed buffers of said speech signal to generate a compressed dB spectrum of the received speech signal for each successive overlapped windowed buffer; adapting a running average of a spectral shape of speech based on a current compressed dB spectrum corresponding to one of the successive overlapping windowed buffers; subtracting, through the use of the processor, the adapted running average of the spectral shape of speech from a target spectral shape, the difference between the target spectral shape and the adapted running average of the spectral shape of speech comprising a spectral shape correction factor; generating, through the use of the processor, a background noise estimate across the frequency sub-bands; calculating, through the use of the processor, a background noise spectral shape correction factor corresponding to a difference between the background noise estimate and a target background noise spectral shape; calculating, through the use of the processor, an overall spectral shape correction factor based on the speech spectral shape correction factor and the background noise spectral shape correction factor; and adding, through the use of the processor, the overall spectral shape correction factor to a spectrum corresponding to one of the successive overlapping windowed buffers, where the step of calculating, through the use of the processor, an overall spectral correction factor comprises inversely weighting the speech spectral shape correction factor and the background noise spectral shape correction factor according to a long term SNR estimate.
-
Specification