Robust separation of speech signals in a noisy environment
First Claim
1. A method for improving a speech signal using a voice activity detector, comprising:
- receiving a first signal;
receiving a second signal;
comparing an energy level in the first signal to an energy level in the second signal;
determining that voice activity is present when the energy level of the first signal is higher than the energy level of the second signal;
generating a control signal responsive to determining that voice activity is present; and
controlling a speech enhancement process using the control signal, wherein the speech enhancement process comprises a signal separation process, and a learning process for the signal separation process is activated responsive to the control signal.
4 Assignments
0 Petitions
Accused Products
Abstract
A method for improving the quality of a speech signal extracted from a noisy acoustic environment is provided. In one approach, a signal separation process is associated with a voice activity detector. The voice activity detector is a two-channel detector, which enables a particularly robust and accurate detection of voice activity. When speech is detected, the voice activity detector generates a control signal. The control signal is used to activate, adjust, or control signal separation processes or post-processing operations to improve the quality of the resulting speech signal. In another approach, a signal separation process is provided as a learning stage and an output stage. The learning stage aggressively adjusts to current acoustic conditions, and passes coefficients to the output stage. The output stage adapts more slowly, and generates a speech-content signal and a noise dominant signal. When the learning stage becomes unstable, only the learning stage is reset, allowing the output stage to continue outputting a high quality speech signal.
-
Citations
44 Claims
-
1. A method for improving a speech signal using a voice activity detector, comprising:
-
receiving a first signal; receiving a second signal; comparing an energy level in the first signal to an energy level in the second signal; determining that voice activity is present when the energy level of the first signal is higher than the energy level of the second signal; generating a control signal responsive to determining that voice activity is present; and controlling a speech enhancement process using the control signal, wherein the speech enhancement process comprises a signal separation process, and a learning process for the signal separation process is activated responsive to the control signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A signal separation process, comprising:
-
receiving a first signal; receiving a second signal; comparing the first signal and the second signal to determine that voice activity is present; generating a control signal responsive to determining that voice activity is present; activating a blind signal separation process responsive to the control signal; receiving the first and second signals into the blind signal separation process; and generating a signal having speech content. - View Dependent Claims (16, 17)
-
-
18. A signal separation system, comprising:
-
a first microphone generating a first signal; a second microphone generating a second signal; a first learning stage receiving the first signal and the second signal, and generating a set of teaching coefficients; the learning stage being configured to rapidly adapt its coefficients to current acoustic conditions; an output stage coupled to the learning stage and receiving the teaching coefficients; the output stage receiving the first signal and the second signal, and generating a speech-content signal and a noise-dominant signal; and the output stage being configured to more slowly adapt its coefficients. - View Dependent Claims (19, 20, 21, 22, 23, 24)
-
-
25. A signal separation system, comprising:
-
means for generating a first signal; means for generating a second signal; means for comparing the first signal and the second signal to determine that voice activity is present; means for generating a control signal responsive to determining that voice activity is present; means for activating a blind signal separation process responsive to the control signal; means for receiving the first and second signals into the blind signal separation process; and means for generating a signal having speech content. - View Dependent Claims (26, 27)
-
-
28. A computer readable storage medium storing computer executable instructions which when executed on a computer perform a method for improving a speech signal using a voice activity detector, the method comprising:
-
receiving a first signal; receiving a second signal; comparing an energy level in the first signal to an energy level in the second signal; determining that voice activity is present when the energy level of the first signal is higher than the energy level of the second signal; generating a control signal responsive to determining that voice activity is present; and controlling a speech enhancement process using the control signal, wherein the speech enhancement process comprises a signal separation process, and a learning process for the signal separation process is activated responsive to the control signal. - View Dependent Claims (29, 30, 31, 32)
-
-
33. A computer readable storage medium storing computer executable instructions which when executed on a computer perform a method for separating signals, the method comprising:
-
receiving a first signal; receiving a second signal; comparing the first signal and the second signal to determine that voice activity is present; generating a control signal responsive to determining that voice activity is present; activating a blind signal separation process responsive to the control signal; receiving the first and second signals into the blind signal separation process; and generating a signal having speech content. - View Dependent Claims (34, 35)
-
-
36. A speech signal improvement system comprising:
-
means for receiving a first signal; means for receiving a second signal; means for comparing an energy level in the first signal to an energy level in the second signal; means for determining that voice activity is present when the energy level of the first signal is higher than the energy level of the second signal; means for generating a control signal responsive to determining that voice activity is present; and means for controlling a speech enhancement process using the control signal, wherein the speech enhancement process comprises a signal separation process, and a learning process for the signal separation process is activated responsive to the control signal. - View Dependent Claims (37, 38)
-
-
39. A speech signal system, comprising:
-
a first microphone generating a first signal; a second microphone generating a second signal; a voice activity detection module configured to; compare an energy level in the first signal to an energy level in the second signal; determine that voice activity is present when the energy level of the first signal is higher than the energy level of the second signal; and generate a control signal for controlling a speech enhancement process and activating a learning process, the control signal being responsive to determining that voice activity is present; wherein the speech enhancement process comprises a signal separation process. - View Dependent Claims (40, 41)
-
-
42. A signal separation system comprising:
-
a first microphone generating a first signal; a second microphone generating a second signal; a voice activity detection module configured to; compare the first signal and the second signal to determine that voice activity is present; and generate a control signal for activating a blind signal separation process, the control signal being responsive to determining that voice activity is present; and a processor device configured to; receive the first and second signals; separate the received signals using the blind signal separation process; and generate a signal having speech content. - View Dependent Claims (43, 44)
-
Specification