Controlling speech enhancement algorithms using near-field spatial statistics
First Claim
1. A process for detecting near-field sounds with at least first and second microphones that receive first and second audio signals, respectively, wherein the first of the microphones is a near-field microphone, said process comprising the steps of:
- providing a first statistic representing a direction of arrival estimate;
providing a second statistic representing far field directional gain, wherein the second statistic is provided by the steps of;
subtracting the second audio signal from the first audio signal to produce a first difference signal;
subtracting the first difference signal from the second audio signal to produce a second difference signal;
deriving the far field directional gain from the second difference signal;
providing a third statistic representing diffuse field gain;
comparing each statistic with a threshold value for each statistic; and
providing an indication of near-field sounds in accordance with the comparisons.
2 Assignments
0 Petitions
Accused Products
Abstract
A telephone includes at least two microphones and a circuit for processing audio signals coupled to the microphones. The circuit processes the signals, in part, by providing at least one statistic representing maximum normalized cross-correlation of the signals from the microphones, doaEst, dirGain, or diffGain and comparing the at least one statistic with a threshold for that statistic. At least one of noise reduction and speech enhancement is controlled by an indication of near-field sounds in accordance with the comparison. Indication of near-field speech can be further enhanced by combining statistics, including a statistic representing inter-microphone level difference, each of which have their own threshold. dirGain and diffGain are derived from signals incident upon the microphones such that the desired near-field signal is not suppressed.
40 Citations
10 Claims
-
1. A process for detecting near-field sounds with at least first and second microphones that receive first and second audio signals, respectively, wherein the first of the microphones is a near-field microphone, said process comprising the steps of:
-
providing a first statistic representing a direction of arrival estimate; providing a second statistic representing far field directional gain, wherein the second statistic is provided by the steps of; subtracting the second audio signal from the first audio signal to produce a first difference signal; subtracting the first difference signal from the second audio signal to produce a second difference signal; deriving the far field directional gain from the second difference signal; providing a third statistic representing diffuse field gain; comparing each statistic with a threshold value for each statistic; and providing an indication of near-field sounds in accordance with the comparisons. - View Dependent Claims (2, 3, 4)
-
-
5. A process for detecting near-field sounds with at least first and second microphones that receive first and second audio signals, respectively, wherein the first of the microphones is a near-field microphone, said process comprising the steps of:
-
providing a first statistic representing a direction of arrival estimate; providing a second statistic representing far field directional gain; providing a third statistic representing diffuse field gain, wherein the third statistic is provided by the steps of; adding the first audio signal to the second audio signal to produce a summed signal; subtracting the summed signal from the second audio signal to produce a difference signal; and deriving the diffuse field gain from the difference signal; comparing each statistic with a threshold value for each statistic; and providing an indication of near-field sounds in accordance with the comparisons. - View Dependent Claims (6, 7, 8)
-
-
9. A telephone comprising in combination:
-
a) a first microphone for receiving a first audio signal, the first microphone being a near-field microphone, b) a second microphone for receiving a second audio signal, c) an audio signal processor circuit for processing the first and second audio signals, the audio signal processor circuit being coupled to said first and second microphones, said audio signal processor circuit processing said first and second audio signals, in part, by; i) providing a maximum normalized cross-correlation of the first and second audio signals, ii) comparing the maximum normalized cross-correlation with a maximum normalized cross-correlation threshold; and iii) providing an indication of the presence of near-field sounds in accordance with the said comparison, d) the audio signal processor circuit also provides a far field directional gain signal by; subtracting the first audio signal from the second audio signal to create a first difference signal; subtracting the first difference signal from the second audio signal to produce a second difference signal; and providing the second difference signal as the far field directional gain signal e) the audio signal processor circuit compares the far field directional gain signal with a far field directional gain threshold; f) the audio signal processor circuit being responsive to the indication of the presence of near-field sounds for controlling operation of at least one of noise reduction and speech enhancement; and g) the audio signal processor circuit providing at least one of noise reduction and speech enhancement.
-
-
10. A telephone comprising in combination:
-
a) a first microphone for receiving a first audio signal, the first microphone being a near-field microphone, b) a second microphone for receiving a second audio signal, c) an audio signal processor circuit for processing the first and second audio signals, the audio signal processor circuit being coupled to said first and second microphones, said audio signal processor circuit processing said first and second audio signals, in part, by; i) providing a maximum normalized cross-correlation of the first and second audio signals, ii) comparing the maximum normalized cross-correlation with a maximum normalized cross-correlation threshold; and iii) providing an indication of the presence of near-field sounds in accordance with said comparison, d) the audio signal processor circuit also providing at least one of noise reduction and speech enhancement, and e) the audio signal processor circuit being responsive to the indication of the presence of near-field sounds for controlling operation of at least one of noise reduction and speech enhancement; f) the audio signal processor circuit also provides a diffuse field gain signal by; adding the first audio signal to the second audio signal to create a summed signal; subtracting the summed signal from the second audio signal to create a difference signal; and providing the difference signal as the diffuse field gain signal; and g) the audio signal processor circuit compares the diffuse field gain signal with a diffuse field gain threshold.
-
Specification