Transient sound event detection
First Claim
Patent Images
1. A method comprising:
- receiving an audio signal from a microphone;
determining a portion of the audio signal corresponding to speech;
determining a prediction error signal that excludes the portion corresponding to the speech;
analyzing the prediction error signal to detect a first audible tap that occurred from a first location and a second audible tap that occurred from a second location;
identifying a portion of the prediction error signal that corresponds in time to the first audible tap;
identifying a portion of the prediction error signal that corresponds in time to the second audible tap;
calculating a first time-difference-of-arrival (TDOA) value based at least in part on the portion of the prediction error signal corresponding in time to the first audible tap;
calculating a second TDOA value based at least in part on the portion of the prediction error signal corresponding in time to the second audible tap;
determining that the first location and the second location are a common location based at least in part on the first TDOA value and the second TDOA value;
determining that the first audible tap and the second audible tap occurred within a predefined time period;
determining an audio double-tap has occurred based at least in part on (a) the first audible tap and the second audible tap originating from the common location and (b) the first audible tap and the second audible tap occurring within the predefined time period; and
causing a computing device to execute a command based at least in part on determining that the audio double-tap has occurred.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques are described for detecting and/or validating audible taps, double-taps, or other tapped audio sequences generated by a user. A predicted voice signal is subtracted from an audio signal to produce a prediction error signal. The prediction error signal is analyzed in terms of dynamics and periodicity to determine whether it contains an audio pulse. In certain cases, prediction error signals corresponding to multiple microphones are analyzed to determine position information regarding the individual taps of a double-tap. A double-tap is validated and acted upon when the individual taps originate from the same location and the taps occur within a given time period.
6 Citations
21 Claims
-
1. A method comprising:
-
receiving an audio signal from a microphone; determining a portion of the audio signal corresponding to speech; determining a prediction error signal that excludes the portion corresponding to the speech; analyzing the prediction error signal to detect a first audible tap that occurred from a first location and a second audible tap that occurred from a second location; identifying a portion of the prediction error signal that corresponds in time to the first audible tap; identifying a portion of the prediction error signal that corresponds in time to the second audible tap; calculating a first time-difference-of-arrival (TDOA) value based at least in part on the portion of the prediction error signal corresponding in time to the first audible tap; calculating a second TDOA value based at least in part on the portion of the prediction error signal corresponding in time to the second audible tap; determining that the first location and the second location are a common location based at least in part on the first TDOA value and the second TDOA value; determining that the first audible tap and the second audible tap occurred within a predefined time period; determining an audio double-tap has occurred based at least in part on (a) the first audible tap and the second audible tap originating from the common location and (b) the first audible tap and the second audible tap occurring within the predefined time period; and causing a computing device to execute a command based at least in part on determining that the audio double-tap has occurred. - View Dependent Claims (2, 3, 4)
-
-
5. A method comprising:
-
receiving an audio signal from a microphone; determining a portion of the audio signal corresponding to speech; determining a prediction error signal that excludes the portion corresponding to the speech; determining a dynamic range of the prediction error signal; determining a periodicity of the prediction error signal; determining, by a processor, an occurrence of an audible tap based at least in part on the dynamic range being greater than a first threshold and the periodicity being less than a second threshold; and causing a computing device to execute a command based at least in part on determining the occurrence of the audible tap. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method comprising:
-
receiving an audio signal; determining a portion of the audio signal corresponding to speech; determining, by one or more processors, a prediction error signal via linear predictive coding, the prediction error signal excluding the portion corresponding to the speech; determining, by the one or more processors, an occurrence of an audible tap based at least in part on analyzing the prediction error signal; and causing a computing device to execute a command based at least in part on determining the occurrence of the audible tap. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21)
-
Specification