Integrated frequency-domain voice coding using an adaptive spectral enhancement filter
First Claim
1. A system for encoding voice with integrated noise suppression, comprising:
- a sampler which converts an analog audio signal into frames of time-domain audio samples;
a voice activity detector operatively coupled to the sampler for determining presence or absence of speech in a current frame;
a transformer operatively coupled to the sampler for transforming the frame of time-domain audio samples to a frequency-domain representation;
a noise model adapter operatively associated with the voice activity detector and the transformer for updating a noise model using a current frame if the voice activity detector determines there is an absence of speech;
a transformer and filter creator operatively coupled to the transformer and the noise model adaptor to create a noise suppression filter; and
a spectral estimator operatively coupled to the transformer and the transformer and filter creator to remove noise characteristics from the frequency-domain representation of the current frame using the noise suppression filter and to develop a set of spectral magnitudes.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for encoding voice while suppressing acoustic background noise and a method for suppressing acoustic background noise in a voice encoder are described herein. The voice encoder includes a sampler that captures frames of time-domain samples of an audio signal. A voice activity detector operatively coupled to the sampler determines presence or absence of speech in the current frame. A transformer is operatively coupled to the sampler for transforming the frame of time-domain audio samples into an estimate of the power spectrum of that frame. A noise model adapter operatively associated with the transformer updates a frequency-domain noise model based on the power spectrum estimate of the current frame if the voice activity detector indicates an absence of speech in this frame. A filter computation block operatively coupled to the noise model adapter and the transform computes a spectral enhancement (noise suppression) filter based on the current power spectrum estimate and the adapted noise model. A spectral enhancement block operatively coupled to the transformer and the filter computation block applies the spectral enhancement filter to the current power spectrum estimate. A quantizer and encoder block transforms the voice encoder model parameters, including the enhanced spectral magnitudes, into a frame of encoded bits.
-
Citations
43 Claims
-
1. A system for encoding voice with integrated noise suppression, comprising:
-
a sampler which converts an analog audio signal into frames of time-domain audio samples; a voice activity detector operatively coupled to the sampler for determining presence or absence of speech in a current frame; a transformer operatively coupled to the sampler for transforming the frame of time-domain audio samples to a frequency-domain representation; a noise model adapter operatively associated with the voice activity detector and the transformer for updating a noise model using a current frame if the voice activity detector determines there is an absence of speech; a transformer and filter creator operatively coupled to the transformer and the noise model adaptor to create a noise suppression filter; and a spectral estimator operatively coupled to the transformer and the transformer and filter creator to remove noise characteristics from the frequency-domain representation of the current frame using the noise suppression filter and to develop a set of spectral magnitudes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system for encoding voice with integrated noise suppression, comprising:
-
a sampler which converts an analog audio signal into frames of time-domain audio samples; a voice activity detector operatively coupled to the sampler for determining presence or absence of speech in a current frame; a transformer operatively coupled to the sampler for transforming the frame of time-domain audio samples to a frequency-domain representation; a noise model adapter operatively associated with the voice activity detector and the transformer for updating a noise model using a current frame if the voice activity detector determines there is an absence of speech; a transformer and filter creator operatively coupled to the transformer and the noise model adaptor to create a noise suppression filter; a spectral estimator operatively coupled to the transformer and the noise model adaptor to remove noise characteristics from the frequency-domain representation of the current frame and to develop a set of spectral magnitudes; and a quantizer and encoder for transforming the developed spectral magnitudes into a frame of encoded bits.
-
-
20. A system for encoding voice with integrated noise suppression, comprising:
-
a sampler which converts an analog audio signal into frames of time-domain audio samples; a voice activity detector operatively coupled to the sampler for determining presence or absence of speech in a current frame; a transformer operatively coupled to the sampler for transforming the frame of time-domain audio samples to a frequency-domain representation; a noise model adapter operatively associated with the voice activity detector and the transformer for updating a noise model using a current frame if the voice activity detector determines there is an absence of speech; a transformer and filter creator operatively coupled to the transformer and the noise model adaptor to create a noise suppression filter; and a spectral estimator operatively coupled to the transformer and the noise model adaptor to remove noise characteristics from the frequency-domain representation of the current frame and to develop a set of spectral magnitudes, wherein the system comprises a multi-band excitation voice encoder.
-
-
21. A system for encoding voice with integrated noise suppression, comprising:
-
a sampler which converts an analog audio signal into frames of time-domain audio samples; a voice activity detector operatively coupled to the sampler for determining presence or absence of speech in a current frame; a transformer operatively coupled to the sampler for transforming the frame of time-domain audio samples to a frequency-domain representation; a noise model adapter operatively associated with the voice activity detector and the transformer for updating a noise model using a current frame if the voice activity detector determines there is an absence of speech; a transformer and filter creator operatively coupled to the transformer and the noise model adaptor to create a noise suppression filter; and a spectral estimator operatively coupled to the transformer and the noise model adaptor to remove noise characteristics from the frequency-domain representation of the current frame using the noise suppression filter and to develop a set of spectral magnitudes, wherein the system comprises a sinusoidal transform voice encoder.
-
-
22. A system for encoding voice with integrated noise suppression, comprising:
-
a sampler which converts an analog audio signal into frames of time-domain audio samples; a voice activity detector operatively coupled to the sampler for determining presence or absence of speech in a current frame; a transformer operatively coupled to the sampler for transforming the frame of time-domain audio samples to a frequency-domain representation; a noise model adapter operatively associated with the voice activity detector and the transformer for updating a noise model using a current frame if the voice activity detector determines there is an absence of speech, the noise model adapter storing a vector of noise model parameters; a transformer and filter creator operatively coupled to the transformer and the noise model adaptor to create a noise suppression filter; and a spectral estimator operatively coupled to the transformer and the noise model adaptor to remove noise characteristics from the frequency-domain representation of the current frame and to develop a set of spectral magnitudes, wherein the voice encoder comprises a multi-band excitation (MBE) voice encoder and wherein the noise model is stored in the same format as the spectral magnitudes of the MBE model.
-
-
23. A system for encoding voice with integrated noise suppression, comprising:
-
a sampler which converts an analog audio signal into frames of time-domain audio samples; a detector operatively coupled to the sampler for determining presence or absence of speech in a current frame; a transformer operatively coupled to the sampler for transforming the frame of time-domain audio samples to a frequency-domain representation; a noise model adapter operatively associated with the voice activity detetor and the transformer for updating a noise model using a current frame if the voice activity detector determines there is an absence of speech; a transformer and filter creator operatively coupled to the transformer and the noise model adapter to convert between a domain of the noise model adapter and the frequency-domain representation and to create a noise suppression filter; a spectral estimator operatively coupled to the transformer and the noise model adaptor to remove noise characteristics from the frequency-domain representation of the current frame using the noise suppression filter; and an encoder transformer coupled to the spectral estimator for transforming the frequency-domain representation of the current frame, having noise characteristics removed, into a frame of encoded bits.
-
-
24. A method of suppressing noise in a voice encoder, comprising the steps of:
-
converting a received analog audio signal into frames of time-domain audio samples; determining presence or absence of speech in a current frame of the time-domain audio samples; transforming the frame time-domain audio samples to a frequency-domain representation; updating a noise model using the transformed current frame if there is an absence of speech creating a noise suppression filter from the frequency-domain representation; and removing noise characteristics from the frequency-domain representation of the current frame using the noise suppression filter and developing a set of spectral magnitudes. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41)
-
-
42. Method of suppressing noise in a voice encoder, comprising the steps of:
-
converting a received analog audio signal into frames of time-domain audio samples; determining presence or absence of speech in a current frame of the time-domain audio samples; transforming the frame time-domain audio samples to a frequency-domain representation; updating a noise model using the transformed current frame if there is an absence of speech; creating a noise suppression filter from the frequency-domain representation; removing noise characteristics from the frequency-domain representation of the current frame and developing a set of spectral magnitudes; and transforming the developed spectral magnitudes into a frame of encoded bits.
-
-
43. A method of suppressing noise in a voice encoder, comprising the steps of:
-
converting a received analog audio signal into frames of time-domain audio samples; determining presence or absence of speech in a current frame of the time-domain audio samples; transforming the frame time-domain audio samples to a frequency-domain representation; updating a noise model using the transformed current frame if there is an absence of speech, wherein the updating step stores a vector of noise model parameters; creating a noise suppression filter from the frequency-domain representation; and removing noise characteristics from the frequency-domain representation of the current frame and developing a set of spectral magnitudes, wherein the voice encoder comprises a multi-band excitation (MBE) voice encoder and wherein the noise model is stored in the same format as the spectral magnitudes of the MBE model.
-
Specification