Visual indication of a recognized voice-initiated action
First Claim
Patent Images
1. A method comprising:
- while receiving initial audio data indicating an initial portion of voice input, outputting, by a computing device and for display, an initial speech recognition graphical user interface (GUI) including at least one element, wherein the at least one element is output for display in a first visual format to indicate the computing device is receiving the initial audio data;
generating, by the computing device, based at least in part on the initial audio data, a transcription the initial audio data; and
while receiving additional audio data indicating a second portion of the voice input;
determining, by the computing device and based at least in part on a comparison of at least one a word from the transcription to a preconfigured set of actions, a voice-initiated action associated with the initial portion of the voice input; and
responsive to determining the voice-initiated action and prior to executing the voice-initiated action;
outputting, by the computing device and for display, a first updated speech recognition GUI including an animation of a change in a position of the at least one element from the initial speech recognition GUI, wherein the animation of the change in the position indicates that the voice-initiated action has been determined based on the initial audio data;
after outputting the first update speech recognition GUI, outputting, by the computing device and for display, a second updated speech recognition GUI including the at least one element from the initial speech recognition GUI, the at least one element from the initial speech recognition GUI being displayed in a second visual format, different from the first visual format, to further indicate that the voice-initiated action has been determined from the initial audio data; and
executing the voice-initiated action based on the initial audio data and the additional audio data.
2 Assignments
0 Petitions
Accused Products
Abstract
A computing device is described that outputs, for display, an initial speech recognition graphical user interface (GUI) having at least one element. The computing device receives audio data and determines, based on the audio data, a voice-initiated action. Responsive to determining the voice-initiated action, the computing device outputs, for display, an updated speech recognition GUI having an animation of a change in a position of the at least one element to indicate that the voice-initiated action has been determined.
37 Citations
14 Claims
-
1. A method comprising:
-
while receiving initial audio data indicating an initial portion of voice input, outputting, by a computing device and for display, an initial speech recognition graphical user interface (GUI) including at least one element, wherein the at least one element is output for display in a first visual format to indicate the computing device is receiving the initial audio data; generating, by the computing device, based at least in part on the initial audio data, a transcription the initial audio data; and while receiving additional audio data indicating a second portion of the voice input; determining, by the computing device and based at least in part on a comparison of at least one a word from the transcription to a preconfigured set of actions, a voice-initiated action associated with the initial portion of the voice input; and responsive to determining the voice-initiated action and prior to executing the voice-initiated action; outputting, by the computing device and for display, a first updated speech recognition GUI including an animation of a change in a position of the at least one element from the initial speech recognition GUI, wherein the animation of the change in the position indicates that the voice-initiated action has been determined based on the initial audio data; after outputting the first update speech recognition GUI, outputting, by the computing device and for display, a second updated speech recognition GUI including the at least one element from the initial speech recognition GUI, the at least one element from the initial speech recognition GUI being displayed in a second visual format, different from the first visual format, to further indicate that the voice-initiated action has been determined from the initial audio data; and executing the voice-initiated action based on the initial audio data and the additional audio data. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computing device comprising:
-
at least one processor; and at least one module operable by the at least one processor to; while receiving initial audio data indicating an initial portion of voice input, output, for display, an initial speech recognition graphical user interface (GUI) including at least one element, wherein the at least one element is output for display in a first visual format to indicate the computing device is receiving the initial audio data; generate, based at least in part on the initial audio data, a transcription of the initial audio data; and while receiving additional audio data indicating a second portion of the voice input; determine, based at least in part on a comparison of at least one a word from the transcription to a preconfigured set of actions, a voice-initiated action associated with the initial portion of the voice input; and responsive to determining the voice-initiated action and prior to executing the voice-initiated action; output, for display, a first updated speech recognition GUI including an animation of a change in a position of the at least one element, from the initial speech recognition GUI, wherein the animation of the change in the position indicates that the voice-initiated action has been determined based on the initial audio data; after outputting the first update speech recognition GUI, output, for display, a second updated speech recognition GUI including the at least one element from the initial speech recognition GUI, the at least one element from the initial speech recognition GUI being displayed in a second visual format, different from the first visual format, to further indicate that the voice-initiated action has been determined from the initial audio data; and executing the voice-initiated action based on the initial audio data and the additional audio data. - View Dependent Claims (9, 10)
-
-
11. A non-transitory computer-readable storage medium comprising instructions that, when executed, configure at least one processor to:
-
while receiving initial audio data indicating an initial portion of voice input, output, for display, an initial speech recognition graphical user interface (GUI) including at least one element, wherein the at least one element is output for display in a first visual format to indicate the computing device is receiving the initial audio data; generate, based at least in part on the initial audio data, a transcription of the initial audio data; and while receiving additional audio data indicating a second portion of the voice input; determine, based at least in part on a comparison of at least one a word from the transcription to a preconfigured set of actions, a voice-initiated action associated with the initial portion of the voice input; and responsive to determining the voice-initiated action and prior to executing the voice-initiated action; output, for display, a first updated speech recognition GUI including an animation of a change in a position of the at least one element, from the initial speech recognition GUI, wherein the animation of the change in the position indicates that the voice-initiated action has been determined based on the initial audio data; after outputting the first update speech recognition GUI, outputting, by the computing device and for display, a second updated speech recognition GUI including the at least one element from the initial speech recognition GUI, the at least one element from the initial speech recognition GUI being displayed in a second visual format, different from the first visual format, to further indicate that the voice-initiated action has been determined from the initial audio data; and executing the voice-initiated action based on initial audio data and the additional audio data. - View Dependent Claims (12, 13, 14)
-
Specification