Robust separation of speech signals in a noisy environment
First Claim
1. A method for improving a speech signal using a voice activity detector, comprising:
- receiving a first signal;
receiving a second signal;
comparing the energy level in the first signal to the energy level in the second signal;
determining that voice activity is present when the energy level of the first signal is higher then the energy level of the second signal;
generating a control signal responsive to determining that voice activity is present; and
controlling a speech enhancement process using the control signal.
4 Assignments
0 Petitions
Accused Products
Abstract
A method for improving the quality of a speech signal extracted from a noisy acoustic environment is provided. In one approach, a signal separation process is associated with a voice activity detector. The voice activity detector is a two-channel detector, which enables a particularly robust and accurate detection of voice activity. When speech is detected, the voice activity detector generates a control signal. The control signal is used to activate, adjust, or control signal separation processes or post-processing operations to improve the quality of the resulting speech signal. In another approach, a signal separation process is provided as a learning stage and an output stage. The learning stage aggressively adjusts to current acoustic conditions, and passes coefficients to the output stage. The output stage adapts more slowly, and generates a speech-content signal and a noise dominant signal. When the learning stage becomes unstable, only the learning stage is reset, allowing the output stage to continue outputting a high quality speech signal.
-
Citations
23 Claims
-
1. A method for improving a speech signal using a voice activity detector, comprising:
-
receiving a first signal;
receiving a second signal;
comparing the energy level in the first signal to the energy level in the second signal;
determining that voice activity is present when the energy level of the first signal is higher then the energy level of the second signal;
generating a control signal responsive to determining that voice activity is present; and
controlling a speech enhancement process using the control signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A signal separation process, comprising:
-
receiving a first signal;
receiving a second signal;
comparing the first signal and the second signal to determine that voice activity is present;
generating a control signal responsive to determining that voice activity is present;
activating a blind signal separation process responsive to the control signal;
receiving the first and second signals into the blind signal separation process; and
generating a signal having speech content. - View Dependent Claims (17, 18)
-
-
19. A signal separation system, comprising:
-
a first microphone generating a first signal;
a second microphone generating a second signal;
a first learning stage receiving the first signal and the second signal, and generating a set of teaching coefficients;
the learning stage being configured to rapidly adapt its coefficients to current acoustic conditions;
an output stage coupled to the learning stage and receiving the teaching coefficients;
the output stage receiving the first signal and the second signal, and generating a speech-content signal and a noise-dominant signal; and
the output stage being configured to more slowly adapt its coefficients. - View Dependent Claims (20, 21, 22, 23)
-
Specification