Estimation of time delay of arrival for microphone arrays
First Claim
1. One or more non-transitory computer-readable media storing computer-executable instructions executable by one or more processors to perform operations comprising:
- receiving acoustic signals from an array of at least first, second, and third microphones, the acoustic signals being associated with an acoustic source in an environment;
generating at least first, second, and third sets of time-difference-of-arrival (TDOA) data, wherein the first set of TDOA data is derived from time differences between the acoustic signals of the first microphone and the second microphone relative to the acoustic signal of the third microphone, wherein the second set of TDOA data is derived from time differences between the acoustic signals of the first microphone and the third microphone relative to the acoustic signal of the second microphone, wherein the third set of TDOA data is derived from time differences between the acoustic signals of the second microphone and the third microphone relative to the acoustic signal of the first microphone;
for the first set of TDOA data, computing a correlation function between the acoustic signal from the first microphone and the acoustic signal from the second microphone, while excluding the acoustic signal from the third microphone, to produce a first correlation value;
for the second set of TDOA data, computing a correlation function between the acoustic signal from the first microphone and the acoustic signal from the third microphone, while excluding the acoustic signal from the second microphone, to produce a second correlation value;
for the third set of TDOA data, computing a correlation function between the acoustic signal from the second microphone and the acoustic signal from the third microphone, while excluding the acoustic signal from the first microphone, to produce a third correlation value;
wherein a comparatively higher correlation value implies that two acoustic signals share similar structure when offset by a time lag, and a comparatively lower correlation value implies that two acoustic signals do not share similar structure when offset by the time lag;
determining that the first correlation value is lowest;
selecting, as a reference microphone, the third microphone; and
localizing the acoustic source in the environment by computing, in part, a direction to the acoustic source based on one of the first, second, and third sets of TDOA data associated with the reference microphone.
2 Assignments
0 Petitions
Accused Products
Abstract
The accuracy and computationally efficient estimation of time different (or delay) of arrival (TDOA) data is improved for localization of a sound. In one aspect, for each acoustic source event, multiple sets of TDOA data are generated, where each set uses a different sensor or microphone to be the reference. One of the microphones is ultimately selected to be the reference microphone based, in part, on correlation functions of the various sets of TDOA data. The selected reference microphone is then used in sound source localization or other signal processing applications. The direction of the sound source is found using a VMRL finding algorithm as a function of a channel vector containing information of the selected channels, the reference channel and a TDOA vector.
12 Citations
20 Claims
-
1. One or more non-transitory computer-readable media storing computer-executable instructions executable by one or more processors to perform operations comprising:
-
receiving acoustic signals from an array of at least first, second, and third microphones, the acoustic signals being associated with an acoustic source in an environment; generating at least first, second, and third sets of time-difference-of-arrival (TDOA) data, wherein the first set of TDOA data is derived from time differences between the acoustic signals of the first microphone and the second microphone relative to the acoustic signal of the third microphone, wherein the second set of TDOA data is derived from time differences between the acoustic signals of the first microphone and the third microphone relative to the acoustic signal of the second microphone, wherein the third set of TDOA data is derived from time differences between the acoustic signals of the second microphone and the third microphone relative to the acoustic signal of the first microphone; for the first set of TDOA data, computing a correlation function between the acoustic signal from the first microphone and the acoustic signal from the second microphone, while excluding the acoustic signal from the third microphone, to produce a first correlation value; for the second set of TDOA data, computing a correlation function between the acoustic signal from the first microphone and the acoustic signal from the third microphone, while excluding the acoustic signal from the second microphone, to produce a second correlation value; for the third set of TDOA data, computing a correlation function between the acoustic signal from the second microphone and the acoustic signal from the third microphone, while excluding the acoustic signal from the first microphone, to produce a third correlation value; wherein a comparatively higher correlation value implies that two acoustic signals share similar structure when offset by a time lag, and a comparatively lower correlation value implies that two acoustic signals do not share similar structure when offset by the time lag; determining that the first correlation value is lowest; selecting, as a reference microphone, the third microphone; and localizing the acoustic source in the environment by computing, in part, a direction to the acoustic source based on one of the first, second, and third sets of TDOA data associated with the reference microphone. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-implemented method comprising:
-
receiving acoustic signals from an array of at least first, second, and third microphones, the acoustic signals being associated with an acoustic source in an environment; generating at least first, second, and third sets of time-difference-of-arrival (TDOA) data, wherein the first set of TDOA data is derived from time differences between the acoustic signals of the first microphone and the second microphone relative to the acoustic signal of the third microphone, wherein the second set of TDOA data is derived from time differences between the acoustic signals of the first microphone and the third microphone relative to the acoustic signal of the second microphone, wherein the third set of TDOA data is derived from time differences between the acoustic signals of the second microphone and the third microphone relative to the acoustic signal of the first microphone; selecting one of the first, second, and third microphones from the array to be a reference microphone and an associated set of the TDOA data such that if the first microphone is selected, the third set of TDOA data is associated with the first microphone, if the second microphone is selected, the second set of TDOA data is associated with the second microphone, and if the third microphone is selected, the first set of TDOA data is associated with the third microphone; and outputting an identity of the selected reference microphone and the associated set of the TDOA data. - View Dependent Claims (6, 7, 8, 9)
-
-
10. A system comprising:
-
a plurality of sensors to detect a sound emanating from an acoustic source in an environment, the plurality of sensors including at least a first sensor, a second sensor and a third sensor; a time-difference-of-arrival estimation module coupled to receive, from the plurality of sensors, signals indicative of a detected sound, wherein the time-difference-of-arrival estimation module is configured to; generate multiple sets of time-difference-of-arrival (TDOA) data; associate the first sensor as a first reference sensor with a first set of the multiple sets of TDOA data; associate the second sensor as a second reference sensor with a second set of the multiple sets of TDOA data, wherein the first reference sensor is different from the second reference sensor; associate the third sensor as a third reference sensor with a third set of the multiple sets of TDOA data; and select, based on the multiple sets of TDOA data, one of the first, second or third sensors to be a reference sensor for the detected sound. - View Dependent Claims (11, 12, 13, 18, 19, 20)
-
-
14. A system comprising:
-
a plurality of sensors to detect a sound emanating from an acoustic source in an environment; and a time-difference-of-arrival estimation module coupled to receive, from the plurality of sensors, signals indicative of the detected sound and configured to generate multiple sets of time-difference-of-arrival (TDOA) data, wherein each of the sets of TDOA data chooses a different sensor from the plurality of sensors to be a reference sensor, and to evaluate the multiple sets of TDOA data to select one of the sensors to be the reference sensor; and a TDOA localization module configured to localize the acoustic source in the environment using, at least in part, the reference sensor and an associated set of the TDOA data, the TDOA localization module finding a direction to the acoustic source by computing a matrix M as follows; - View Dependent Claims (15, 16, 17)
-
Specification