Multisensory speech detection
First Claim
Patent Images
1. A computer-implemented method, comprising:
- initiating, by a mobile computing device, an audio recording process using a microphone of the mobile computing device;
identifying, by the mobile computing device, a first orientation of the mobile computing device at a beginning of the audio recording process;
determining, during the audio recording process, that the mobile computing device has transitioned from the first orientation to a second orientation;
in response to determining that the mobile computing device has transitioned from the first orientation to a second orientation, determining a speech energy threshold for comparing to speech received during the audio recording process and while the mobile computing device is in the second orientation;
comparing an energy of the speech received during the audio recording process and while the mobile computing device is in the second orientation to the speech energy threshold;
determining an end of speech condition based on (i) the energy of the speech received during the audio recording process and while the mobile computing device is in the second orientation and (ii) the speech energy threshold; and
stopping, based on determining the end of speech condition, the audio recording process.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method of multisensory speech detection is disclosed. The method comprises determining an orientation of a mobile device and determining an operating mode of the mobile device based on the orientation of the mobile device. The method further includes identifying speech detection parameters that specify when speech detection begins or ends based on the determined operating mode and detecting speech from a user of the mobile device based on the speech detection parameters.
93 Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
initiating, by a mobile computing device, an audio recording process using a microphone of the mobile computing device; identifying, by the mobile computing device, a first orientation of the mobile computing device at a beginning of the audio recording process; determining, during the audio recording process, that the mobile computing device has transitioned from the first orientation to a second orientation; in response to determining that the mobile computing device has transitioned from the first orientation to a second orientation, determining a speech energy threshold for comparing to speech received during the audio recording process and while the mobile computing device is in the second orientation; comparing an energy of the speech received during the audio recording process and while the mobile computing device is in the second orientation to the speech energy threshold; determining an end of speech condition based on (i) the energy of the speech received during the audio recording process and while the mobile computing device is in the second orientation and (ii) the speech energy threshold; and stopping, based on determining the end of speech condition, the audio recording process. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer storage medium encoded with a computer program, the program comprising instructions that when executed by data processing apparatus cause the data processing apparatus to perform operations comprising:
-
initiating, by a mobile computing device, an audio recording process using a microphone of the mobile computing device; identifying, by the mobile computing device, a first orientation of the mobile computing device at a beginning of the audio recording process; determining, during the audio recording process, that the mobile computing device has transitioned from the first orientation to a second orientation; in response to determining that the mobile computing device has transitioned from the first orientation to a second orientation, determining a speech energy threshold for comparing to speech received during the audio recording process and while the mobile computing device is in the second orientation; comparing an energy of the speech received during the audio recording process and while the mobile computing device is in the second orientation to the speech energy threshold; determining an end of speech condition based on (i) the energy of the speech received during the audio recording process and while the mobile computing device is in the second orientation and (ii) the speech energy threshold; and stopping, based on determining the end of speech condition, the audio recording process. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; initiating, by a mobile computing device, an audio recording process using a microphone of the mobile computing device; identifying, by the mobile computing device, a first orientation of the mobile computing device at a beginning of the audio recording process; determining, during the audio recording process, that the mobile computing device has transitioned from the first orientation to a second orientation; in response to determining that the mobile computing device has transitioned from the first orientation to a second orientation, determining a speech energy threshold for comparing to speech received during the audio recording process and while the mobile computing device is in the second orientation; comparing an energy of the speech received during the audio recording process and while the mobile computing device is in the second orientation to the speech energy threshold; determining an end of speech condition based on (i) the energy of the speech received during the audio recording process and while the mobile computing device is in the second orientation and (ii) the speech energy threshold; and stopping, based on determining the end of speech condition, the audio recording process. - View Dependent Claims (17, 18, 19, 20)
Specification