System for suppressing acoustic echoes and interferences in multi-channel audio systems
First Claim
1. A method for suppressing an interference signal from a microphone output signal to produce a clean speech signal, the interference signal being first and second loudspeaker signals modified by first and second acoustic paths through which the loudspeaker signals reach a microphone, the interference signal combining with the clean speech signal to form the microphone output signal, the method comprising:
- determining an acoustic response for each of the first and second acoustic paths in a frequency domain;
determining an estimate of the interference signal in a frequency domain using the acoustic response for each of the first and second acoustic paths;
suppressing the estimate of interference signal from the microphone output signal to obtain the clean speech signal in the frequency domain; and
translating the clean speech signal into time domain.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for obtaining a clean speech signal in a communication system having a transducer for receiving a clean speech signal from a user and having a pair of loudspeakers for providing an output signal to the user. The output signal contains loudspeaker signals which interfere with the clean speech signal, the loudspeaker signals traveling through acoustic paths to reach the transducer. The transducer receives an input signal containing the loudspeaker signals and the clean speech signal. The method includes a number of steps, namely, performing a short time Fourier transform (STFT) on the input signal to obtain at least one frequency component, performing a short time Fourier transform (STFT) on the loudspeaker signals to obtain frequency components, summing the frequency components to obtain an interference sum, and subtracting the interference sum from the at least one frequency component to obtain the clean speech signal for translation into a time domain.
82 Citations
23 Claims
-
1. A method for suppressing an interference signal from a microphone output signal to produce a clean speech signal, the interference signal being first and second loudspeaker signals modified by first and second acoustic paths through which the loudspeaker signals reach a microphone, the interference signal combining with the clean speech signal to form the microphone output signal, the method comprising:
-
determining an acoustic response for each of the first and second acoustic paths in a frequency domain;
determining an estimate of the interference signal in a frequency domain using the acoustic response for each of the first and second acoustic paths;
suppressing the estimate of interference signal from the microphone output signal to obtain the clean speech signal in the frequency domain; and
translating the clean speech signal into time domain. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 22, 23)
-
-
11. In a communication system having a transducer for receiving a clean speech signal from a user, and having first and second loudspeakers for providing an output signal to the user, the output signal containing first and second loudspeaker signals which interfere with the clean speech signal traveling through first and second acoustic paths to reach the transducer, the transducer receiving an input signal containing the first and second loudspeaker signals and the clean speech signal, a method of obtaining the clean speech signal, the method comprising:
-
performing a short-time Fourier transform (STFT) on the input signal to obtain at least one frequency component;
performing a short-time Fourier transform (STFT) on the first and second loudspeaker signals to obtain first and second frequency components, respectively;
summing the first and second frequency components to obtain an interference sum; and
subtracting the interference sum from the at least one frequency component to obtain the clean speech signal for translation into a time domain. - View Dependent Claims (12, 13, 15, 16, 17, 19, 20, 21)
-
-
14. In a communication system having a local microphone for transmitting signals to a remote user through a communication channel, and first and second local loudspeakers for receiving signals from the remote user via the communication channel, the microphone receiving a microphone output signal comprising a clean speech signal from a local user and an interference signal from the first and second loudspeakers, a system for suppressing the interference signal, the system comprising:
-
a first transform module performing a short-time Fourier transform (STFT) on the first loudspeaker signal to obtain a first frequency sub-band signal;
a second transform module performing a short-time Fourier transform (STFT) on the second loudspeaker signal to obtain a second frequency sub-band signal;
a third transform module performing a short-time Fourier transform (STFT) on the microphone output signal to obtain a third frequency sub-band signal;
a subtractor module subtracting the first and second frequency sub-band signals from the third frequency sub-band signal to obtain a clean speech signal; and
an inverse short-time Fourier transform (ISTFT) module translating the clean speech signal into time domain.
-
-
18. An acoustic echo suppression method comprising:
-
receiving an input signal containing first and second acoustic echo signals and a clean speech signal;
transforming the first and second acoustic echo signals into first and second frequency domain signals;
determining a sum of magnitudes for each of the first and second frequency domain signals;
transforming the input signal into a third frequency domain signal;
determining a sum for the magnitude of the first frequency domain signal and the second frequency domain signal;
determining a magnitude of the third frequency domain signal; and
canceling the first and second echo signals by generating a difference signal between the sum of the magnitudes for each of the first and second frequency domain signals and the magnitude of the third frequency domain signal, the difference signal being transformed into a time domain signal to obtain the clean speech signal.
-
Specification