Multisensory speech detection
First Claim
Patent Images
1. A method comprising:
- detecting, by data processing hardware of a mobile device, movement of the mobile device from a first pose to a second pose, the second pose corresponding to the mobile device in a talking pose proximate to a part of a user of the mobile device;
in response to detecting the movement of the mobile device from the first pose to the second pose;
initiating, by the data processing hardware, execution of an audio recording process using a microphone of the mobile device; and
notifying, by the data processing hardware, the user of the mobile device when execution of the audio recording process starts by;
generating a visual notification that indicates to the user when execution of the audio recording process starts; and
displaying the visual notification on a user interface of the mobile device, wherein the visual notification comprises a microphone graphic;
receiving, at the data processing hardware, a speech utterance of the user captured by the microphone during execution of the audio recording process; and
generating, by the data processing hardware, a transcription of the speech utterance captured by the microphone during the audio recording process.
0 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method of multisensory speech detection is disclosed. The method comprises determining an orientation of a mobile device and determining an operating mode of the mobile device based on the orientation of the mobile device. The method further includes identifying speech detection parameters that specify when speech detection begins or ends based on the determined operating mode and detecting speech from a user of the mobile device based on the speech detection parameters.
92 Citations
28 Claims
-
1. A method comprising:
-
detecting, by data processing hardware of a mobile device, movement of the mobile device from a first pose to a second pose, the second pose corresponding to the mobile device in a talking pose proximate to a part of a user of the mobile device; in response to detecting the movement of the mobile device from the first pose to the second pose; initiating, by the data processing hardware, execution of an audio recording process using a microphone of the mobile device; and notifying, by the data processing hardware, the user of the mobile device when execution of the audio recording process starts by; generating a visual notification that indicates to the user when execution of the audio recording process starts; and displaying the visual notification on a user interface of the mobile device, wherein the visual notification comprises a microphone graphic; receiving, at the data processing hardware, a speech utterance of the user captured by the microphone during execution of the audio recording process; and generating, by the data processing hardware, a transcription of the speech utterance captured by the microphone during the audio recording process. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A mobile device comprising:
-
data processing hardware; and memory hardware in communication with the data processing hardware and storing instructions that when executed, cause the data processing hardware to perform operations comprising; detecting movement of the mobile device from a first pose to a second pose, the second pose corresponding to the mobile device in a talking pose proximate to a part of a user of the mobile device; in response to detecting the movement of the mobile device from the first pose to the second pose; initiating execution of an audio recording process using a microphone of the mobile device; notifying the user of the mobile device when execution of the audio recording process starts by; generating a visual notification that indicates to the user when execution of the audio recording process starts; and displaying the visual notification on a user interface of the mobile device, wherein the visual notification comprises a microphone graphic; receiving a speech utterance of the user captured by the microphone during execution of the audio recording process; and generating a transcription of the speech utterance captured by the microphone during the audio recording process. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
Specification