System and method of detecting a user's voice activity using an accelerometer
First Claim
1. A method of detecting a user'"'"'s voice activity in a mobile device comprising:
- generating by a voice activity detector (VAD) a VAD output based on (i) acoustic signals received from microphones included in the mobile device and (ii) data output by an inertial sensor that is included in an earphone portion of the mobile device, the inertial sensor to detect vibration of the user'"'"'s vocal chords modulated by the user'"'"'s vocal tract based on vibrations in bones and tissue of the user'"'"'s head, wherein generating the VAD output comprises;
detecting voiced speech included in the acoustic signals,detecting the vibration of the user'"'"'s vocal chords from the data output by the inertial sensor,computing the coincidence of the detected speech in acoustic signals and the vibration of the user'"'"'s vocal chords, andsetting the VAD output to indicate that the user'"'"'s voiced speech is detected if the coincidence is detected and setting the VAD output to indicate that the user'"'"'s voiced speech is not detected if the coincidence is not detected.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of detecting a user'"'"'s voice activity in a mobile device is described herein. The method starts with a voice activity detector (VAD) generating a VAD output based on (i) acoustic signals received from microphones included in the mobile device and (ii) data output by an inertial sensor that is included in an earphone portion of the mobile device. The inertial sensor may detect vibration of the user'"'"'s vocal chords modulated by the user'"'"'s vocal tract based on vibrations in bones and tissue of the user'"'"'s head. A noise suppressor may then receive the acoustic signals from the microphones and the VAD output and suppress the noise included in the acoustic signals received from the microphones based on the VAD output. The method may also include steering one or more beamformers based on the VAD output. Other embodiments are also described.
32 Citations
35 Claims
-
1. A method of detecting a user'"'"'s voice activity in a mobile device comprising:
-
generating by a voice activity detector (VAD) a VAD output based on (i) acoustic signals received from microphones included in the mobile device and (ii) data output by an inertial sensor that is included in an earphone portion of the mobile device, the inertial sensor to detect vibration of the user'"'"'s vocal chords modulated by the user'"'"'s vocal tract based on vibrations in bones and tissue of the user'"'"'s head, wherein generating the VAD output comprises; detecting voiced speech included in the acoustic signals, detecting the vibration of the user'"'"'s vocal chords from the data output by the inertial sensor, computing the coincidence of the detected speech in acoustic signals and the vibration of the user'"'"'s vocal chords, and setting the VAD output to indicate that the user'"'"'s voiced speech is detected if the coincidence is detected and setting the VAD output to indicate that the user'"'"'s voiced speech is not detected if the coincidence is not detected. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A mobile device detecting a user'"'"'s voice activity comprising:
-
an accelerometer to detect vibration of the user'"'"'s vocal chords modulated by the user'"'"'s vocal tract based on vibrations in bones and tissue of the user'"'"'s head, wherein the accelerometer is included in an earphone portion of the mobile device; a voice activity detector (VAD) coupled to the accelerometer, the VAD to generate a VAD output based on (i) acoustic signals received from microphones included in the mobile device and (ii) data output by the accelerometer, wherein the VAD generates the VAD output by; detecting speech included in the acoustic signals, detecting the vibrations of the user'"'"'s vocal chords from the data output by the accelerometer, computing the coincidence of the detected speech in acoustic signals and the vibrations of the user'"'"'s vocal chords, and setting the VAD output to indicate that the user'"'"'s voiced speech is detected if the coincidence is detected and setting the VAD output to indicate that the user'"'"'s voiced speech is not detected if the coincidence is not detected; and a noise suppressor coupled to the microphones and the VAD, the noise suppressor to suppress noise from the acoustic signals from the microphones based on the VAD output. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
-
Specification