Augmenting Speech Segmentation and Recognition Using Head-Mounted Vibration and/or Motion Sensors
First Claim
1. A method, comprising:
- receiving audio data representative of audio detected by a microphone, wherein the microphone is positioned on a head-mountable device (HMD), and wherein the received audio data comprises audio speech data in an audio-channel speech band;
receiving vibration data representative of vibrations detected by a sensor other than the microphone, wherein the sensor is positioned on the HMD, and wherein the received vibration data comprises vibration speech data in a vibration-channel speech band;
determining that the audio speech data-is causally related to the vibration speech data; and
in response to determining that the audio speech data is causally related to the vibration speech data, generating an indication that the audio data contains HMD-wearer speech.
2 Assignments
0 Petitions
Accused Products
Abstract
Example methods and systems use multiple sensors to determine whether a speaker is speaking. Audio data in an audio-channel speech band detected by a microphone can be received. Vibration data in a vibration-channel speech band representative of vibrations detected by a sensor other than the microphone can be received. The microphone and the sensor can be associated with a head-mountable device (HMD). It is determined whether the audio data is causally related to the vibration data. If the audio data and the vibration data are causally related, an indication can be generated that the audio data contains HMD-wearer speech. Causally related audio and vibration data can be used to increase accuracy of text transcription of the HMD-wearer speech. If the audio data and the vibration data are not causally related, an indication can be generated that the audio data does not contain HMD-wearer speech.
68 Citations
23 Claims
-
1. A method, comprising:
-
receiving audio data representative of audio detected by a microphone, wherein the microphone is positioned on a head-mountable device (HMD), and wherein the received audio data comprises audio speech data in an audio-channel speech band; receiving vibration data representative of vibrations detected by a sensor other than the microphone, wherein the sensor is positioned on the HMD, and wherein the received vibration data comprises vibration speech data in a vibration-channel speech band; determining that the audio speech data-is causally related to the vibration speech data; and in response to determining that the audio speech data is causally related to the vibration speech data, generating an indication that the audio data contains HMD-wearer speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 21)
-
-
8. A head-mountable device (HMD), comprising:
-
a processor; a microphone; a sensor; a non-transitory computer-readable medium; and program instructions stored on the non-transitory computer-readable medium, wherein the program instructions are executable by the processor to cause the HMD to perform functions comprising; receiving audio data representative of audio detected by the microphone, wherein the received audio data comprises audio speech data in an audio-channel speech band; receiving vibration data representative of vibrations detected by the sensor, wherein the received vibration data comprises vibration speech data in a vibration-channel speech band; determining that the audio speech data-is causally related to the vibration speech data; and in response to determining that the audio speech data-is causally related to the vibration speech data, generating an indication that the audio data contains HMD-wearer speech. - View Dependent Claims (9, 10, 11, 12, 13, 14, 22)
-
-
15. An article of manufacture including a non-transitory computer-readable medium having instructions stored thereon that, when executed by a computing device, cause the computing device to perform functions comprising:
-
receiving audio data representative of audio detected by a microphone positioned on a head-mountable display (HMD), wherein the received audio data comprises audio speech data in an audio-channel speech band; receiving vibration data representative of vibrations detected by a sensor positioned on the HMD, wherein the received vibration data comprises vibration speech data in a vibration-channel speech band; determining that the audio speech data is causally related to the vibration speech data; and in response to determining that the audio speech data is causally related to the vibration speech data, generating an indication that the audio data contains HMD-wearer speech. - View Dependent Claims (16, 17, 18, 19, 20, 23)
-
Specification