Time difference of arrival determination with direct sound
First Claim
Patent Images
1. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed, cause one or more processors to perform acts comprising:
- acquiring, from a plurality of microphones, a plurality of audio signals associated with an acoustic source;
filtering the plurality of audio signals with a band-pass filter;
estimating a noise floor of the plurality of audio signals;
for individual audio signals of the plurality of audio signals;
identifying an event of interest;
determining that the event of interest rises above the noise floor;
adjusting the noise floor at a rate that is based at least partly on an extent to which the event of interest rises above the noise floor;
detecting a peak after the event of interest rose above the noise floor; and
determining a time at which the peak occurs; and
determining time-difference-of-arrival (TDOA) values for the event of interest based at least in part on a difference between the time at which the peaks occur in the individual audio signals.
2 Assignments
0 Petitions
Accused Products
Abstract
Acoustic signals may be localized such that their position in space is determined. Time-difference-of-arrival data from multiple microphones may be used for this localization. Signal data from the microphones may be degraded by reverberation and other environmental distortions, resulting in erroneous localization. By detecting a portion of the signal resulting from sound directly reaching a microphone rather than from a reverberation, accuracy of the localization is improved.
22 Citations
47 Claims
-
1. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed, cause one or more processors to perform acts comprising:
-
acquiring, from a plurality of microphones, a plurality of audio signals associated with an acoustic source; filtering the plurality of audio signals with a band-pass filter; estimating a noise floor of the plurality of audio signals; for individual audio signals of the plurality of audio signals; identifying an event of interest; determining that the event of interest rises above the noise floor; adjusting the noise floor at a rate that is based at least partly on an extent to which the event of interest rises above the noise floor; detecting a peak after the event of interest rose above the noise floor; and determining a time at which the peak occurs; and determining time-difference-of-arrival (TDOA) values for the event of interest based at least in part on a difference between the time at which the peaks occur in the individual audio signals. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed, cause one or more processors to perform acts comprising:
-
acquiring a plurality of audio signals associated with an acoustic source; for individual audio signals of the plurality of audio signals; identifying an event of interest by identifying a portion of the audio signal that corresponds to direct sound from the acoustic source and disregarding portions of the audio signal that correspond to indirect sound from the acoustic source; determining that the event of interest rises above a noise floor; detecting a peak after the event of interest rose above the noise floor; and determining a time at which the peak occurs; and determining time-difference-of-arrival (TDOA) values for the event of interest based at least in part on a difference between the time at which the peaks occur in the plurality of filtered audio signals. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system comprising:
-
a plurality of sensors; a time-difference-of-arrival module coupled to the sensors and configured to; acquire, via the sensors, a plurality of signals associated with an acoustic source; for individual audio signals of the plurality of signals; identify an event of interest; determine that the event of interest rises above a noise floor; based at least partly on the event of interest rising above the noise floor, increase the noise floor at a first rate; determine that the event of interest falls below the noise floor; and based at least partly on the event of interest falling below the noise floor, decrease the noise floor at a second rate different from the first rate. - View Dependent Claims (20, 21, 22)
-
-
23. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed, cause one or more processors to perform acts comprising:
-
acquiring, from a plurality of microphones, a plurality of audio signals associated with an acoustic source generated by a human speaker; estimating a noise floor of the plurality of audio signals; and for individual audio signals of the plurality of audio signals; identifying an event of interest that comprises human speech; determining a time that the event of interest rises above the noise floor; designating a window of samples starting within a threshold amount of time from the time that the event of interest rises above the noise floor; and adjusting the noise floor at a rate that is based at least partly on an extent to which the event of interest rises above the noise floor. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
-
-
36. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed, cause one or more processors to perform acts comprising:
-
acquiring, from a plurality of microphones, a plurality of audio signals associated with an acoustic source, wherein a portion of an audio signal of the plurality of audio signals is indicative of direct sound traveling directly from the acoustic source and portions of the audio signal are indicative of indirect sound traveling indirectly from the acoustic source; calculating a time-difference-of-arrival value for the audio signal at least partly by disregarding the portions of the audio signal that are indicative of the indirect sound; and determining a location of the acoustic source based at least in part upon the time-difference-of arrival value; and for an audio signal of the plurality of audio signals; estimating a noise floor of the audio signal; identifying an event of interest that comprises human speech; and adjusting the noise floor at a rate that is based at least partly on an extent to which the event of interest rises above the noise floor. - View Dependent Claims (37, 38, 39, 40, 41, 42)
-
-
43. A system comprising:
-
a plurality of sensors; a time-difference-of-arrival module coupled to the sensors and configured to; acquire, from a plurality of sensors, a plurality of audio signals associated with an acoustic source generated by a human speaker; for individual ones of the plurality of the audio signals; identify an event of interest that comprises human speech; determine a time that the event of interest rises above a noise floor; and adjusting the noise floor at a rate that is based at least partly on an extent to which the event of interest rises above the noise floor. - View Dependent Claims (44, 45, 46, 47)
-
Specification