Weighted frequency-channel background noise suppressor
First Claim
1. A system for suppressing background noise in audio data, comprising:
- a detector configured to perform a manipulation process on said audio data, said detector including a filter bank that generates filtered channel energy by separating said audio data into discrete frequency channels, said detector including a weighting module that weights selected components of said audio data to suppress said background noise, said weighting module generating noise-suppressed channel energy by applying separate weighting values directly to each of said discrete frequency channels of said filtered channel energy, said separate weighting values being related to background noise values of said discrete frequency channels; and
a processor coupled to said system to control said detector for suppressing said background noise.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for implementing a noise suppressor in a speech recognition system comprises a filter bank for separating source speech data into discrete frequency sub-bands to generate filtered channel energy, and a noise suppressor for weighting the frequency sub-bands to improve the signal-to-noise ratio of the resultant noise-suppressed channel energy. The noise suppressor preferably includes a noise calculator for calculating background noise values, a speech energy calculator for calculating speech energy values for each channel of the filter bank, and a weighting module for applying calculated weighting values to the projected channel energy to generate the noise-suppressed channel energy.
40 Citations
42 Claims
-
1. A system for suppressing background noise in audio data, comprising:
-
a detector configured to perform a manipulation process on said audio data, said detector including a filter bank that generates filtered channel energy by separating said audio data into discrete frequency channels, said detector including a weighting module that weights selected components of said audio data to suppress said background noise, said weighting module generating noise-suppressed channel energy by applying separate weighting values directly to each of said discrete frequency channels of said filtered channel energy, said separate weighting values being related to background noise values of said discrete frequency channels; and
a processor coupled to said system to control said detector for suppressing said background noise. - View Dependent Claims (2, 3, 4, 5, 8, 9, 10, 11)
-
-
9. The system of claim 8 wherein A system for suppressing background noise in audio data, comprising:
a detector configured to perform a manipulation process on said audio data that includes digital source speech data provided to said speech detector by an analog sound sensor and an analog-to-digital converter, said detector including a filter bank that generates filtered channel energy by separating said digital source speech data into discrete frequency channels, said detector including a speech detector with program instructions that are stored in a memory device, said speech detector including a noise suppressor with a noise calculator, a speech energy calculator, and a weighting module, said speech detector weighting selected components of said audio data to suppress said background noise, said noise calculator deriving a channel average background noise value “
Ni(m)”
for a channel m at a frame i by using an iterative equation
-
10. The system of claim 5 wherein A system for suppressing background noise in audio data, comprising:
a detector configured to perform a manipulation process on said audio data that includes digital source speech data provided to said speech detector by an analog sound sensor and an analog-to-digital converter, said detector including a filter bank that generates filtered channel energy by separating said digital source speech data into discrete frequency channels, said detector including a speech detector with program instructions that are stored in a memory device, said speech detector including a noise suppressor with a noise calculator, a speech energy calculator, and a weighting module, said speech detector weighting selected components of said audio data to suppress said background noise, said noise calculator utilizing a non-linear spectrum subtraction procedure that removes a mean value and produces a channel average background noise variance value “
Vi(m)”
for a channel m at a frame i, said channel average background noise variance value “
Vi(m)”
for said channel m at said frame i being calculated using an iterative equation
-
11. The system of claim 10 wherein said a is equal to 0.985 which is equivalent to a window size of 145 frames.
-
6. A system for suppressing background noise in audio data, comprising:
a detector configured to perform a manipulation process on said audio data that includes digital source speech data provided to said speech detector by an analog sound sensor and an analog-to-digital converter, said detector including a filter bank that generates filtered channel energy by separating said digital source speech data into discrete frequency channels, said detector including a speech detector with program instructions that are stored in a memory device, said speech detector including a noise suppressor with a noise calculator, a speech energy calculator, and a weighting module, said speech detector weighting selected components of said audio data to suppress said background noise, said noise calculator calculating background noise values during a silent segment of said audio data, said silent segment being located below an ending noise-calculation threshold that is expressed by the formula;
-
7. A system for suppressing background noise in audio data, comprising:
a detector configured to perform a manipulation process on said audio data that includes digital source speech data provided to said speech detector by an analog sound sensor and an analog-to-digital converter, said detector including a filter bank that generates filtered channel energy by separating said digital source speech data into discrete frequency channels, said detector including a speech detector with program instructions that are stored in a memory device, said speech detector including a noise suppressor with a noise calculator, a speech energy calculator, and a weighting module, said speech detector weighting selected components of said audio data to suppress said background noise, said noise calculator calculating background noise values during a silent segment of said audio data, said silent segment being located below a beginning noise-calculation threshold that is expressed by the formula;
-
12. A system for suppressing background noise in audio data, comprising:
-
a detector configured to perform a manipulation process on said audio data that includes digital source speech data provided to said speech detector by an analog sound sensor and an analog-to-digital converter, said detector including a filter bank that generates filtered channel energy by separating said digital source speech data into discrete frequency channels, said detector including a speech detector with program instructions that are stored in a memory device, said speech detector including a noise suppressor with a noise calculator, a speech energy calculator, and a weighting module, said speech detector weighting selected components of said audio data to suppress said background noise, said weighting module generating noise-suppressed channel energy by applying separate weighting values to each of said discrete frequency channels of said filtered channel energy, said separate weighting values being related to background noise values of said discrete frequency channels; and
a processor coupled to said system to control said detector for suppressing said background noise. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
-
15. The system of claim 12 wherein said weighting module calculates a weighting value “
- wi(m)”
for said channel “
i”
using a formula
- wi(m)”
-
16. The system of claim 12 wherein said weighting module calculates a weighting value “
- wi(m)”
for said channel “
i”
using a formula
- wi(m)”
-
17. The system of claim 16 wherein said MINV is equal to one of a value between 0.0001 and 0.0002, and a value equal to 0.00013.
-
18. The system of claim 12 wherein an endpoint detector analyzes said noise-suppressed channel energy to generate an endpoint signal.
-
19. The system of claim 18 wherein said endpoint detector calculates endpoint detection parameters according to a formula
-
( i ) = ∑ m = 0 M - 1 y i ( m ) w i ( m ) where said wi(m) is a respective weighting value, said yi(m) is a channel signal energy value of said channel m at said frame i, and said M is a total number of said channels of said filter bank.
-
-
20. The system of claim 19 wherein a recognizer analyzes said endpoint signals and feature vectors from a feature extractor to generate a speech detection result for said speech detector.
-
21. A method for suppressing background noise in audio data, comprising:
-
performing a manipulation process on said audio data using a detector that includes a filter bank that generates filtered channel energy by separating said audio data into discrete frequency channels, said detector including a weighting module that weights selected components of said audio data to suppress said background noise, said weighting module generating noise-suppressed channel energy by applying separate weighting values directly to each of said discrete frequency channels of said filtered channel energy, said separate weighting values being related to background noise values of said discrete frequency channels; and
controlling said detector with a processor to thereby suppress said background noise. - View Dependent Claims (22, 23, 24, 25, 26, 28)
performing a manipulation process on said audio data using a detector, said audio data including digital source speech data provided to said speech detector by an analog sound sensor and an analog-to-digital converter, said detector including a filter bank that generates filtered channel energy by separating said digital source speech data into discrete frequency channels, said detector including a speech detector with program instructions that are stored in a memory device, said speech detector including a noise suppressor with a noise calculator, a speech energy calculator, and a weighting module, said speech detector weighting selected components of said audio data to suppress said background noise, said noise calculator calculating background noise values during a silent segment of said audio data, said silent segment being located below an ending noise-calculation threshold that is expressed by the formula;
-
-
28. The method of claim 25 wherein said noise calculator derives a channel average background noise value “
- Ni(m)”
for a channel m at a frame i by using an iterative equation
- Ni(m)”
-
27. A method for suppressing background noise in audio data, comprising:
performing a manipulation process on said audio data using a detector, said audio data including digital source speech data provided to said speech detector by an analog sound sensor and an analog-to-digital converter, said detector including a filter bank that generates filtered channel energy by separating said digital source speech data into discrete frequency channels, said detector including a speech detector with program instructions that are stored in a memory device, said speech detector including a noise suppressor with a noise calculator, a speech energy calculator, and a weighting module, said speech detector weighting selected components of said audio data to suppress said background noise, said noise calculator calculating background noise values during a silent segment of said audio data, said silent segment being located below an ending noise-calculation threshold that is expressed by the formula;
-
29. A method for suppressing background noise in audio data, comprising:
performing a manipulation process on said audio data using a detector, said audio data including digital source speech data provided to said speech detector by an analog sound sensor and an analog-to-digital converter, said detector including a filter bank that generates filtered channel energy by separating said digital source speech data into discrete frequency channels, said detector including a speech detector with program instruction s that are stored in a memory device, said speech detector including a noise suppressor with a noise calculator, a speech energy calculator, and a weighting module, said speech detector weighting selected components of said audio data to suppress said background noise, said noise calculator deriving a channel average background noise value “
Ni(m)”
for a channel m at a frame i by using an iterative equation
-
30. A method for suppressing background noise in audio data, comprising:
performing a manipulation process on said audio data using a detector, said audio data including digital source speech data provided to said speech detector by an analog sound sensor and an analog-to-digital converter, said detector including a filter bank that generates filtered channel energy by separating said digital source speech data into discrete frequency channels, said detector including a speech detector with program instructions that are stored in a memory device, said speech detector including a noise suppressor with a noise calculator, a speech energy calculator, and a weighting module, said speech detector weighting selected components of said audio data to suppress said background noise, said noise calculator utilizing a non-linear spectrum subtraction procedure that removes a mean value and produces a channel average background noise variance value “
Vi(m)”
for a channel m at a frame i, said channel average background noise variance value “
Vi(m)”
for said channel m at said frame i being calculated using an iterative equation- View Dependent Claims (31)
-
32. A method for suppressing background noise in audio data, comprising:
-
performing a manipulation process on said audio data using a detector, said audio data including digital source speech data provided to said speech detector by an analog sound sensor and an analog-to-digital converter, said detector including a filter bank that generates filtered channel energy by separating said digital source speech data into discrete frequency channels, said detector including a speech detector with program instructions that are stored in a memory device, said speech detector including a noise suppressor with a noise calculator, a speech energy calculator, and a weighting module, said speech detector weighting selected components of said audio data to suppress said background noise, said weighting module generating noise-suppressed channel energy by applying separate weighting values to each of said discrete frequency channels of said filtered channel energy, said separate weighting values being related to background noise values of said discrete frequency channels; and
controlling said detector with a processor to thereby suppress said background noise. - View Dependent Claims (33, 34, 35, 36, 37, 38, 39, 40)
-
-
35. The method of claim 32 wherein said weighting module calculates a weighting value “
- wi(m)”
for said channel “
i”
using a formula
- wi(m)”
-
36. The method of claim 32 wherein said weighting module calculates a weighting value “
- wi(m)”
for said channel “
i”
using a formula
- wi(m)”
-
37. The method of claim 36 wherein said MINV is equal to one of a value between 0.0001 and 0.0002, and a value equal to 0.00013.
-
38. The method of claim 32 wherein an endpoint detector analyzes said noise-suppressed channel energy to generate an endpoint signal.
-
39. The method of claim 38 wherein said endpoint detector calculates endpoint detection parameters according to a formula
-
( i ) = ∑ m = 0 M - 1 y i ( m ) w i ( m ) where said wi(m) is a respective weighting value, said yi(m) is a channel signal energy value of said channel m at said frame i, and said M is a total number of said channels of said filter bank.
-
-
40. The method of claim 39 wherein a recognizer analyzes said endpoint signals and feature vectors from a feature extractor to generate a speech detection result for said speech detector.
-
41. A computer-readable medium comprising program instructions for suppressing background noise by:
-
performing a manipulation process on said audio data using a detector that includes a filter bank that generates filtered channel energy by separating said audio data into discrete frequency channels, said detector including a weighting module that weights selected components of said audio data to suppress said background noise, said weighting module generating noise-suppressed channel energy by applying separate weighting values directly to each of said discrete frequency channels of said filtered channel energy, said separate weighting values being related to background noise values of said discrete frequency channels; and
controlling said detector with a processor to thereby suppress said background noise.
-
-
42. A system for suppressing background noise in audio data, comprising:
-
means for performing a manipulation process on said audio data, said means for performing including a filter bank that generates filtered channel energy by separating said audio data into discrete frequency channels, said means for performing also including a weighting module that weights selected components of said audio data to suppress said background noise, said weighting module generating noise-suppressed channel energy by applying separate weighting values directly to each of said discrete frequency channels of said filtered channel energy, said separate weighting values being related to background noise values of said discrete frequency channels;
means for controlling said means for performing to thereby suppress said background noise.
-
Specification