Reduction of background noise for speech enhancement
First Claim
1. An apparatus for the perceived real-time suppression of background noise in an input audio signal having components of noise comprising:
- a framer for dividing the input audio signal into a sequence of audio signal frames;
a windower for producing a current windowed audio signal frame, combining all of one audio signal frame with some of the audio signal frame immediately preceding in time the one audio signal frame;
a transformer for obtaining a group of frequency spectrum components from the current windowed audio signal frame;
a noise estimator using the frequency spectrum components to produce a noise estimate of an amount of noise in the frequency spectrum components;
a noise suppression spectral modifier for producing current gain multiplicative factors based on the noise estimate and the frequency spectrum components;
a delayer for delaying, by a fixed number of frames, the frequency spectrum components of said sequence of audio signal frames to produce delayed frequency spectrum components;
a controlled attenuator for attenuating the delayed frequency spectrum components of a previous frame of said sequence based on the current gain multiplicative factors produced using said current frame to produce noise-reduced frequency components; and
an inverse transformer for converting the noise-reduced frequency components to the time domain.
6 Assignments
0 Petitions
Accused Products
Abstract
Properties of human audio perception are used to perform spectral and time masking to reduce perceived loudness of noise added to the speech signal. A signal is divided temporally into blocks which are then passed through notch filters to remove narrow frequency band components of the noise. Each block is then appended to part of the previous block in a manner which avoids block boundary discontinuities. An FFT is then performed on the resulting larger block, after which the spectral components of the signal are fed to a background noise estimator. Each frequency component of the signal is analyzed with respect to the background noise to determine, within various confidence levels, whether it is pure noise or a noise-and-signal combination. The frequency band'"'"'s gain function is determined, based on the confidence levels. A spectral valley finder detects and fills in spectral valleys in the frequency component gain function, after which the function is used to modify the magnitude components of the FFT. An inverse FFT then maps the signal back from the frequency domain to the time domain to give a frame of noise-reduced signal. This signal is then multiplied by a temporal window and joined to the previous frame'"'"'s signal to derive the output.
264 Citations
21 Claims
-
1. An apparatus for the perceived real-time suppression of background noise in an input audio signal having components of noise comprising:
-
a framer for dividing the input audio signal into a sequence of audio signal frames; a windower for producing a current windowed audio signal frame, combining all of one audio signal frame with some of the audio signal frame immediately preceding in time the one audio signal frame; a transformer for obtaining a group of frequency spectrum components from the current windowed audio signal frame; a noise estimator using the frequency spectrum components to produce a noise estimate of an amount of noise in the frequency spectrum components; a noise suppression spectral modifier for producing current gain multiplicative factors based on the noise estimate and the frequency spectrum components; a delayer for delaying, by a fixed number of frames, the frequency spectrum components of said sequence of audio signal frames to produce delayed frequency spectrum components; a controlled attenuator for attenuating the delayed frequency spectrum components of a previous frame of said sequence based on the current gain multiplicative factors produced using said current frame to produce noise-reduced frequency components; and an inverse transformer for converting the noise-reduced frequency components to the time domain. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An apparatus for the perceived suppression of background noise in an input audio signal having components of noise comprising:
-
a framer for dividing the input audio signal into a sequence of audio signal frames; a windower for producing a current windowed audio signal frame, combining all of one audio signal frame with some of the audio signal frame immediately preceding in time the one audio signal frame; a transformer for obtaining a group of frequency spectrum components from the windowed audio signal frame; a noise estimator using the frequency spectrum components to produce a noise estimate of an amount of noise in the frequency spectrum components; a noise suppression spectral modifier for producing current gain multiplicative factors based on the noise estimate and the frequency spectrum components; a delayer for delaying the frequency spectrum components of said sequence of audio signal frames to produce delayed frequency spectrum components;
a controlled attenuator for attenuating the delayed frequency spectrum components of a previous frame of said sequence based on the current gain multiplicative factors produced using said current frame to produce noise-reduced frequency components; andan inverse transformer for converting the noise-reduced frequency components to the time domain, and wherein the noise suppression spectral modifier further comprises; means for using previously produced gain multiplicative factors for determining current gain multiplicative factors; a global decision mechanism for making, for a group of the frequency spectrum components of a frame a determination as to whether that group is noise; a local noise decision mechanism for deriving, for each frequency component of the frequency spectrum components, a confidence level for each component, whether that frequency component is a noise component; a detector for determining, based on the confidence levels, initial gain multiplicative factors for each frequency component; and a spreading mechanism for spectrally and temporally effecting adjustment of the initial gain multiplicative factors. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. An apparatus for the perceived suppression of background noise in an input audio signal having components of noise comprising:
-
a transformer for obtaining frequency spectrum components from an audio signal frame derived from the input audio signal; a detector for determining multiplicative gain factors for each frequency component; a spreading mechanism for adjusting the multiplicative gain factors for effecting temporal and spectral spreading; and a controlled attenuator for attenuating the frequency components in accordance with said adjusted multiplicative gain factors to derive a noise-modified spectral signal.
-
-
16. A method for reducing the perception of background noise of an input audio signal comprising the steps of:
-
dividing the input audio signal into a sequence of audio signal frames; producing a windowed audio signal frame; combining all of one audio signal frame with some of the audio signal frame immediately preceding in time the one audio signal frame to obtain a current frame; obtaining a group of frequency spectrum components from the windowed audio signal frame; using the frequency spectrum components to produce a noise estimate of an amount of noise in the frequency spectrum components; producing current gain multiplicative factors based on the noise estimate and the frequency spectrum components; delaying, by a fixed number of frames, the frequency spectrum components of said sequence to produce delayed frequency spectrum components; attenuating the delayed frequency spectrum components of a previous frame based on the current gain multiplicative factors produced using said current frame to produce noise-reduced frequency components; and converting the noise-reduced frequency components to the time domain. - View Dependent Claims (17, 18)
-
-
19. A method for reducing the perception of background noise of an input audio signal comprising the steps of:
-
dividing the input audio signal into a sequence of audio signal frames; producing a windowed audio signal frame; combining all of one audio signal frame with some of the audio signal frame immediately preceding in time the one audio signal frame to obtain a current frame; obtaining a group frequency spectrum components from the current windowed audio signal frame; using the frequency spectrum components to produce a noise estimate of an amount of noise in the frequency spectrum components; producing current gain multiplicative factors based on the noise estimate and the frequency spectrum components; delaying the frequency spectrum components of said sequence to produce delayed frequency spectrum components; attenuating the delayed frequency spectrum components of a previous frame based on the current gain multiplicative factors produced using said current frame to produce noise-reduced frequency components; converting the noise-reduced frequency components to the time domain; and wherein the step of producing current gain multiplicative factors further comprises the steps of; using previously produced gain multiplicative factors for determining said current gain multiplicative factors; making, for a group of the frequency spectrum components of a frame, a determination as to whether that group is noise; deriving, for each frequency component of the frequency spectrum components, a confidence level for each component, whether that frequency component is a noise component; determining, based on the confidence levels, initial gain multiplicative factors for each frequency component; and spectrally and temporally effecting adjustment of the initial gain multiplicative factors. - View Dependent Claims (20)
-
-
21. A method for reducing the perception of background noise in an input audio signal having components of noise comprising the steps of:
-
obtaining frequency spectrum components from an audio signal frame derived from the input audio signal; determining multiplicative gain factors for each frequency component; adjusting the multiplicative gain factors for effecting temporal and spectral spreading; and attenuating the frequency components in accordance with said adjusted multiplicative gain factors to derive a noise-modified spectral signal.
-
Specification