METHOD, APPARATUS, AND MANUFACTURE FOR TWO-MICROPHONE ARRAY SPEECH ENHANCEMENT FOR AN AUTOMOTIVE ENVIRONMENT

US 20140270241A1
Filed: 03/15/2013
Published: 09/18/2014
Est. Priority Date: 03/15/2013
Status: Abandoned Application

First Claim

Patent Images

1. A method for speech enhancement in an automotive environment, comprising:

enabling a user to select between three modes of operation, including;

a mode for enhancing driver speech only, a mode for enhancing front passenger speech only, and a mode for enhancing both driver speech and front passenger speech;

receiving;

a first microphone signal from a first microphone of a two-microphone array, and a second microphone signal from a second microphone of the two-microphone array;

decomposing the first microphone signal and the second microphone signal into a plurality of subbands;

performing at least one signal processing method on the each subband of the decomposed first and second microphone signals to provide a first signal processing output signal and a second signal processing output signal;

performing an acoustic events detection to make a determination as to whether;

the driver is speaking, the front passenger is speaking, or neither front driver nor the front passenger is speaking;

providing an acoustics events detection output signal, wherein providing the acoustics events detection output signal includes;

during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the driver is speaking, providing the first signal processing output signal as the acoustic event detection output signal;

during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the front passenger is speaking;

attenuating the first signal processing output signal, and providing the attenuated first signal processing output signal as the acoustic event detection output signal;

during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the front passenger is speaking, providing the second signal processing output signal as the acoustic event detection output signal;

during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the driver is speaking;

attenuating the second signal processing output signal, and providing the attenuated second signal processing output signal as the acoustic event detection output signal; and

during the mode for enhancing both driver speech and front passenger speech, if the acoustics event determination is a determination that the driver is speaking or a determination that the front passenger is speaking, providing the first and second signal processing output signals as the acoustic event detection output signal; and

combining each subband of the acoustic event detection output signal.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method, apparatus, and manufacture for speech enhancement in an automotive environment is provided. Signals from first and second microphones of a two-microphone array are decomposed into subbands. At least one signal processing method is performed on the each subband of the decomposed signals to provide a first signal processing output signal and a second signal processing output signal. Subsequently, an acoustic events detection determination is made as to whether the driver, the front passenger, or neither is speaking. An acoustic events detection output signal is provided by selecting the first or second signal processing output signal and by either attenuating the selected signal or not, based on a currently selected operating mode and based on the result of the acoustic events detection determination. Each subband of the acoustics events detection output signal is then combined.

12 Citations

View as Search Results

22 Claims

1. A method for speech enhancement in an automotive environment, comprising:
- enabling a user to select between three modes of operation, including;
  
  a mode for enhancing driver speech only, a mode for enhancing front passenger speech only, and a mode for enhancing both driver speech and front passenger speech;
  
  receiving;
  
  a first microphone signal from a first microphone of a two-microphone array, and a second microphone signal from a second microphone of the two-microphone array;
  
  decomposing the first microphone signal and the second microphone signal into a plurality of subbands;
  
  performing at least one signal processing method on the each subband of the decomposed first and second microphone signals to provide a first signal processing output signal and a second signal processing output signal;
  
  performing an acoustic events detection to make a determination as to whether;
  
  the driver is speaking, the front passenger is speaking, or neither front driver nor the front passenger is speaking;
  
  providing an acoustics events detection output signal, wherein providing the acoustics events detection output signal includes;
  
  during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the driver is speaking, providing the first signal processing output signal as the acoustic event detection output signal;
  
  during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the front passenger is speaking;
  
  attenuating the first signal processing output signal, and providing the attenuated first signal processing output signal as the acoustic event detection output signal;
  
  during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the front passenger is speaking, providing the second signal processing output signal as the acoustic event detection output signal;
  
  during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the driver is speaking;
  
  attenuating the second signal processing output signal, and providing the attenuated second signal processing output signal as the acoustic event detection output signal; and
  
  during the mode for enhancing both driver speech and front passenger speech, if the acoustics event determination is a determination that the driver is speaking or a determination that the front passenger is speaking, providing the first and second signal processing output signals as the acoustic event detection output signal; and
  
  combining each subband of the acoustic event detection output signal.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The method of claim 1, wherein decomposing the first microphone signal and the second microphone signal is accomplished with an analysis filter bank, and wherein combining each subband of the acoustic event detection output signal is accomplished with a synthesis filter bank.
  - 3. The method of claim 1, further comprising calibrating the first and second microphone signals.
  - 4. The method of claim 1, whereinthe acoustics event determination is made by comparing a testing statistic to a first threshold and a second threshold, wherein the acoustic event detection determination is a determination that the driver is speaking if the testing statistic exceeds both the first threshold and the second threshold, the determination is that the front passenger is speaking if the testing statistics fails to exceed both the first threshold and the second threshold, and the determination is that neither the driver nor the front passenger is speaking if the testing statistic is between the first threshold and the second threshold, wherein the testing statistic is based, at least in part, on a comparison of a first ratio and a second ratio, wherein the first ratio is the ratio of a power associated with the first processing output signal and a power associated with the first microphone signal, and the second ratio is a ratio of a power associated with the second processing output signal and a power associated with the second microphone signal.
  - 5. The method of claim 1, wherein providing the acoustic event detection output signal further includes:
    - if the acoustics events determination is a determination that neither the driver nor the front passenger is speaking;
      
      attenuating the first signal processing output signal, and providing the attenuated first signal processing output signal as the acoustic event detection output signal.
  - 6. The method of claim 1, wherein the at least one signal processing method includes at least one of adaptive beamforming and adaptive de-correlation filtering.
  - 7. The method of claim 6, wherein the at least one signal processing method further includes noise reduction applied to each channel after performing the at least one of the adaptive beamforming and the adaptive de-correlation filtering.

8. An apparatus for speech enhancement in an automotive environment, comprising:
- a memory that is configured to store a plurality of sets of pre-determined beamforming weights, wherein each of the sets of pre-determined beamforming weights has a corresponding integral index number; and
  
  a processor that is configured to execute code that enables actions, including;
  
  enabling a user to select between three modes of operation, including;
  
  a mode for enhancing driver speech only, a mode for enhancing front passenger speech only, and a mode for enhancing both driver speech and front passenger speech;
  
  receiving;
  
  a first microphone signal from a first microphone of a two-microphone array, and a second microphone signal from a second microphone of the two-microphone array;
  
  decomposing the first microphone signal and the second microphone signal into a plurality of subbands;
  
  performing at least one signal processing method on the each subband of the decomposed first and second microphone signals to provide a first signal processing output signal and a second signal processing output signal;
  
  performing an acoustic events detection to make a determination as to whether;
  
  the driver is speaking, the front passenger is speaking, or neither front driver nor the front passenger is speaking;
  
  providing an acoustics events detection output signal, wherein providing the acoustics events detection output signal includes;
  
  during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the driver is speaking, providing the first signal processing output signal as the acoustic event detection output signal;
  
  during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the front passenger is speaking;
  
  attenuating the first signal processing output signal, and providing the attenuated first signal processing output signal as the acoustic event detection output signal;
  
  during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the front passenger is speaking, providing the second signal processing output signal as the acoustic event detection output signal;
  
  during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the driver is speaking;
  
  attenuating the second signal processing output signal, and providing the attenuated second signal processing output signal as the acoustic event detection output signal; and
  
  during the mode for enhancing both driver speech and front passenger speech, if the acoustics event determination is a determination that the driver is speaking or a determination that the front passenger is speaking, providing the first and second signal processing output signals as the acoustic event detection output signal; and
  
  combining each subband of the acoustic event detection output signal.
- View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
- - 9. The apparatus of claim 8, wherein processor is further configured such that the at least one signal processing method includes at least one of adaptive beamforming and adaptive de-correlation filtering.
  - 10. The apparatus of claim 8, further comprising:
    - the two-microphone array.
  - 11. The apparatus of claim 10, wherein the first microphone of the two-microphone array is an omni-directional microphone, and wherein the second microphone of the two-microphone array is another omni-directional microphone.
  - 12. The apparatus of claim 10, wherein the first microphone of the two-microphone array is an uni-directional microphone, the second microphone of the two-microphone array is another uni-directional microphone, and wherein the first and second microphone are arranged in a side-to-side configuration.
  - 13. The apparatus of claim 10, wherein the first microphone of the two-microphone array is an uni-directional microphone, the second microphone of the two-microphone array is another uni-directional microphone, and wherein the first and second microphone are arranged in a back-to-back configuration.
  - 14. The apparatus of claim 10, wherein a distance from the first microphone to the second microphone is from 1 centimeter to 30 centimeters.
  - 15. The apparatus of claim 10, wherein the two-microphone array is installed on a ceiling roof of an automobile in between positions for a driver and a front passenger.
  - 16. The apparatus of claim 10, wherein the two-microphone array is installed on at least one of a front head lamp panel of an automobile or on a back of the head lamp of the automobile.

17. A tangible processor-readable storage medium that arranged to encode processor-readable code, which, when executed by one or more processors, enables actions for speech enhancement in an automotive environment, comprising:
- enabling a user to select between three modes of operation, including;
  
  a mode for enhancing driver speech only, a mode for enhancing front passenger speech only, and a mode for enhancing both driver speech and front passenger speech;
  
  receiving;
  
  a first microphone signal from a first microphone of a two-microphone array, and a second microphone signal from a second microphone of the two-microphone array;
  
  decomposing the first microphone signal and the second microphone signal into a plurality of subbands;
  
  performing at least one signal processing method on the each subband of the decomposed first and second microphone signals to provide a first signal processing output signal and a second signal processing output signal;
  
  performing an acoustic events detection to make a determination as to whether;
  
  the driver is speaking, the front passenger is speaking, or neither front driver nor the front passenger is speaking;
  
  providing an acoustics events detection output signal, wherein providing the acoustics events detection output signal includes;
  
  during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the driver is speaking, providing the first signal processing output signal as the acoustic event detection output signal;
  
  during the mode for enhancing driver speech only, if the acoustic events detection determination is a determination that the front passenger is speaking;
  
  attenuating the first signal processing output signal, and providing the attenuated first signal processing output signal as the acoustic event detection output signal;
  
  during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the front passenger is speaking, providing the second signal processing output signal as the acoustic event detection output signal;
  
  during the mode for enhancing front passenger speech only, if the acoustic events detection determination is a determination that the driver is speaking;
  
  attenuating the second signal processing output signal, and providing the attenuated second signal processing output signal as the acoustic event detection output signal; and
  
  during the mode for enhancing both driver speech and front passenger speech, if the acoustics event determination is a determination that the driver is speaking or a determination that the front passenger is speaking, providing the first and second signal processing output signals as the acoustic event detection output signal; and
  
  combining each subband of the acoustic event detection output signal.
- View Dependent Claims (18)
- - 18. The tangible processor-readable medium of claim 17, wherein the at least one signal processing method includes at least one of adaptive beamforming and adaptive de-correlation filtering.

19. A method for speech enhancement in an automotive environment, comprising:
- receiving;
  
  a first microphone signal from a first microphone of a two-microphone array, and a second microphone signal from a second microphone of the two-microphone array;
  
  decomposing the first microphone signal and the second microphone signal into a plurality of subbands;
  
  calibrating the first and second microphone signals;
  
  performing at least one signal processing method on the each subband of the decomposed first and second microphone signals to provide a first signal processing output signal and a second signal processing output signal, wherein the signal processing method includes at least one of adaptive beamforming and adaptive de-correlation filtering;
  
  performing an acoustic events detection to make a determination as to whether;
  
  the driver is speaking, the front passenger is speaking, or neither front driver nor the front passenger is speaking;
  
  providing an acoustics events detection output signal from first and second signal processing output signals based, at least in part, on a current system mode and the acoustics event detection determination; and
  
  combining each subband of the acoustic event detection output signal.
- View Dependent Claims (20, 21, 22)
- - 20. The method of claim 19, wherein the at least one signal processing method further includes noise reduction applied to each channel after performing the at least one of the adaptive beamforming and the adaptive de-correlation filtering.
  - 21. The method of claim 19, wherein the at least one signal processing method includes adaptive beamforming followed by adaptive de-correlation filtering.
  - 22. The method of claim 21, wherein the at least one signal processing method further includes noise reduction applied to each channel after performing the adaptive de-correlation filtering.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
CSR Technology Incorporated (Qualcomm, Inc.)
Original Assignee
CSR Technology Incorporated (Qualcomm, Inc.)
Inventors
Yu, Tao, Alves, Rogerio G.

Application Number

US13/843,254
Publication Number

US 20140270241A1
Time in Patent Office

Days
Field of Search
US Class Current

381/86
CPC Class Codes

G10L 2021/02165   Two microphones, one receiv...

G10L 21/0208   Noise filtering

H04R 2430/03   Synergistic effects of band...

H04R 2430/20   Processing of the output si...

H04R 2499/13   Acoustic transducers and so...

H04R 29/006   Microphone matching

H04R 3/005   for combining the signals o...

METHOD, APPARATUS, AND MANUFACTURE FOR TWO-MICROPHONE ARRAY SPEECH ENHANCEMENT FOR AN AUTOMOTIVE ENVIRONMENT

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

12 Citations

22 Claims

Specification

Solutions

Use Cases

Quick Links

METHOD, APPARATUS, AND MANUFACTURE FOR TWO-MICROPHONE ARRAY SPEECH ENHANCEMENT FOR AN AUTOMOTIVE ENVIRONMENT

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

12 Citations

22 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links