METHOD, APPARATUS, AND MANUFACTURE FOR TWO-MICROPHONE ARRAY SPEECH ENHANCEMENT FOR AN AUTOMOTIVE ENVIRONMENT
First Claim
1. A method for speech enhancement in an automotive environment, comprising:
- enabling a user to select between three modes of operation, including;
a mode for enhancing driver speech only, a mode for enhancing front passenger speech only, and a mode for enhancing both driver speech and front passenger speech;
receiving;
a first microphone signal from a first microphone of a two-microphone array, and a second microphone signal from a second microphone of the two-microphone array;
decomposing the first microphone signal and the second microphone signal into a plurality of subbands;
performing at least one signal processing method on the each subband of the decomposed first and second microphone signals to provide a first signal processing output signal and a second signal processing output signal;
performing an acoustic events detection to make a determination as to whether;
the driver is speaking, the front passenger is speaking, or neither front driver nor the front passenger is speaking;
providing an acoustics events detection output signal, wherein providing the acoustics events detection output signal includes;
during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the driver is speaking, providing the first signal processing output signal as the acoustic event detection output signal;
during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the front passenger is speaking;
attenuating the first signal processing output signal, and providing the attenuated first signal processing output signal as the acoustic event detection output signal;
during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the front passenger is speaking, providing the second signal processing output signal as the acoustic event detection output signal;
during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the driver is speaking;
attenuating the second signal processing output signal, and providing the attenuated second signal processing output signal as the acoustic event detection output signal; and
during the mode for enhancing both driver speech and front passenger speech, if the acoustics event determination is a determination that the driver is speaking or a determination that the front passenger is speaking, providing the first and second signal processing output signals as the acoustic event detection output signal; and
combining each subband of the acoustic event detection output signal.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, apparatus, and manufacture for speech enhancement in an automotive environment is provided. Signals from first and second microphones of a two-microphone array are decomposed into subbands. At least one signal processing method is performed on the each subband of the decomposed signals to provide a first signal processing output signal and a second signal processing output signal. Subsequently, an acoustic events detection determination is made as to whether the driver, the front passenger, or neither is speaking. An acoustic events detection output signal is provided by selecting the first or second signal processing output signal and by either attenuating the selected signal or not, based on a currently selected operating mode and based on the result of the acoustic events detection determination. Each subband of the acoustics events detection output signal is then combined.
12 Citations
22 Claims
-
1. A method for speech enhancement in an automotive environment, comprising:
-
enabling a user to select between three modes of operation, including;
a mode for enhancing driver speech only, a mode for enhancing front passenger speech only, and a mode for enhancing both driver speech and front passenger speech;receiving;
a first microphone signal from a first microphone of a two-microphone array, and a second microphone signal from a second microphone of the two-microphone array;decomposing the first microphone signal and the second microphone signal into a plurality of subbands; performing at least one signal processing method on the each subband of the decomposed first and second microphone signals to provide a first signal processing output signal and a second signal processing output signal; performing an acoustic events detection to make a determination as to whether;
the driver is speaking, the front passenger is speaking, or neither front driver nor the front passenger is speaking;providing an acoustics events detection output signal, wherein providing the acoustics events detection output signal includes; during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the driver is speaking, providing the first signal processing output signal as the acoustic event detection output signal; during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the front passenger is speaking;
attenuating the first signal processing output signal, and providing the attenuated first signal processing output signal as the acoustic event detection output signal;during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the front passenger is speaking, providing the second signal processing output signal as the acoustic event detection output signal; during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the driver is speaking;
attenuating the second signal processing output signal, and providing the attenuated second signal processing output signal as the acoustic event detection output signal; andduring the mode for enhancing both driver speech and front passenger speech, if the acoustics event determination is a determination that the driver is speaking or a determination that the front passenger is speaking, providing the first and second signal processing output signals as the acoustic event detection output signal; and combining each subband of the acoustic event detection output signal. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An apparatus for speech enhancement in an automotive environment, comprising:
-
a memory that is configured to store a plurality of sets of pre-determined beamforming weights, wherein each of the sets of pre-determined beamforming weights has a corresponding integral index number; and a processor that is configured to execute code that enables actions, including; enabling a user to select between three modes of operation, including;
a mode for enhancing driver speech only, a mode for enhancing front passenger speech only, and a mode for enhancing both driver speech and front passenger speech;receiving;
a first microphone signal from a first microphone of a two-microphone array, and a second microphone signal from a second microphone of the two-microphone array;decomposing the first microphone signal and the second microphone signal into a plurality of subbands; performing at least one signal processing method on the each subband of the decomposed first and second microphone signals to provide a first signal processing output signal and a second signal processing output signal; performing an acoustic events detection to make a determination as to whether;
the driver is speaking, the front passenger is speaking, or neither front driver nor the front passenger is speaking;providing an acoustics events detection output signal, wherein providing the acoustics events detection output signal includes; during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the driver is speaking, providing the first signal processing output signal as the acoustic event detection output signal; during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the front passenger is speaking;
attenuating the first signal processing output signal, and providing the attenuated first signal processing output signal as the acoustic event detection output signal;during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the front passenger is speaking, providing the second signal processing output signal as the acoustic event detection output signal; during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the driver is speaking;
attenuating the second signal processing output signal, and providing the attenuated second signal processing output signal as the acoustic event detection output signal; andduring the mode for enhancing both driver speech and front passenger speech, if the acoustics event determination is a determination that the driver is speaking or a determination that the front passenger is speaking, providing the first and second signal processing output signals as the acoustic event detection output signal; and combining each subband of the acoustic event detection output signal. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A tangible processor-readable storage medium that arranged to encode processor-readable code, which, when executed by one or more processors, enables actions for speech enhancement in an automotive environment, comprising:
-
enabling a user to select between three modes of operation, including;
a mode for enhancing driver speech only, a mode for enhancing front passenger speech only, and a mode for enhancing both driver speech and front passenger speech;receiving;
a first microphone signal from a first microphone of a two-microphone array, and a second microphone signal from a second microphone of the two-microphone array;decomposing the first microphone signal and the second microphone signal into a plurality of subbands; performing at least one signal processing method on the each subband of the decomposed first and second microphone signals to provide a first signal processing output signal and a second signal processing output signal; performing an acoustic events detection to make a determination as to whether;
the driver is speaking, the front passenger is speaking, or neither front driver nor the front passenger is speaking;providing an acoustics events detection output signal, wherein providing the acoustics events detection output signal includes; during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the driver is speaking, providing the first signal processing output signal as the acoustic event detection output signal; during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the front passenger is speaking;
attenuating the first signal processing output signal, and providing the attenuated first signal processing output signal as the acoustic event detection output signal;during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the front passenger is speaking, providing the second signal processing output signal as the acoustic event detection output signal; during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the driver is speaking;
attenuating the second signal processing output signal, and providing the attenuated second signal processing output signal as the acoustic event detection output signal; andduring the mode for enhancing both driver speech and front passenger speech, if the acoustics event determination is a determination that the driver is speaking or a determination that the front passenger is speaking, providing the first and second signal processing output signals as the acoustic event detection output signal; and combining each subband of the acoustic event detection output signal. - View Dependent Claims (18)
-
-
19. A method for speech enhancement in an automotive environment, comprising:
-
receiving;
a first microphone signal from a first microphone of a two-microphone array, and a second microphone signal from a second microphone of the two-microphone array;decomposing the first microphone signal and the second microphone signal into a plurality of subbands; calibrating the first and second microphone signals; performing at least one signal processing method on the each subband of the decomposed first and second microphone signals to provide a first signal processing output signal and a second signal processing output signal, wherein the signal processing method includes at least one of adaptive beamforming and adaptive de-correlation filtering; performing an acoustic events detection to make a determination as to whether;
the driver is speaking, the front passenger is speaking, or neither front driver nor the front passenger is speaking;providing an acoustics events detection output signal from first and second signal processing output signals based, at least in part, on a current system mode and the acoustics event detection determination; and combining each subband of the acoustic event detection output signal. - View Dependent Claims (20, 21, 22)
-
Specification