Visual confirmation for a recognized voice-initiated action
First Claim
1. A method comprising:
- outputting, by a first application executing at a computing device and for display, a speech recognition graphical user interface (GUI) having at least one non-textual element in a first visual format;
receiving, by the first application executing at the computing device, first audio data of a voice command that indicates one or more words of the voice command;
determining, by the first application executing at the computing device, based on the one or more words of the voice command, a voice-initiated action indicated by the first audio data of the voice command, wherein the voice-initiated action is a particular voice-initiated action from a plurality of voice-initiated actions and the voice-initiated action is associated with a second application that is different than the first application;
responsive to determining the voice-initiated action indicated by the first audio data of the voice command, and while receiving second audio data of the voice command that indicates one or more additional words of the voice command, and prior to executing the second application to perform the voice command, outputting, by the first application executing at the computing device, for display, an updated speech recognition GUI in which the at least one non-textual element, from the speech recognition GUI, transitions from being displayed in the first visual format to being displayed in a second visual format, different from the first visual format, indicating that the voice-initiated action is the particular voice-initiated action from the plurality of voice-initiated actions that has been determined from the first audio data of the voice command, wherein;
the first visual format of the at least one non-textual element is a first image representative of a speech recognition mode of the first application,the second visual format of the at least one non-textual element is a second image that replaces the first image and corresponds to the voice-initiated action from the plurality of voice-initiated actions, andthe second image is different from other images corresponding to one or more other voice-initiated actions from the plurality of voice-initiated actions; and
after outputting the updated speech recognition GUI and after receiving the second audio data of the voice command, executing, by the computing device, based on the first audio data and the second audio data, the second application that performs the voice-initiated action indicated by the voice command.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques described herein provide a computing device configured to provide an indication that the computing device has recognized a voice-initiated action. In one example, a method is provided for outputting, by a computing device and for display, a speech recognition graphical user interface (GUI) having at least one element in a first visual format. The method further includes receiving, by the computing device, audio data and determining, by the computing device, a voice-initiated action based on the audio data. The method also includes outputting, while receiving additional audio data and prior to executing a voice-initiated action based on the audio data, and for display, an updated speech recognition GUI in which the at least one element is displayed in a second visual format, different from the first visual format, to indicate that the voice-initiated action has been identified.
49 Citations
20 Claims
-
1. A method comprising:
-
outputting, by a first application executing at a computing device and for display, a speech recognition graphical user interface (GUI) having at least one non-textual element in a first visual format; receiving, by the first application executing at the computing device, first audio data of a voice command that indicates one or more words of the voice command; determining, by the first application executing at the computing device, based on the one or more words of the voice command, a voice-initiated action indicated by the first audio data of the voice command, wherein the voice-initiated action is a particular voice-initiated action from a plurality of voice-initiated actions and the voice-initiated action is associated with a second application that is different than the first application; responsive to determining the voice-initiated action indicated by the first audio data of the voice command, and while receiving second audio data of the voice command that indicates one or more additional words of the voice command, and prior to executing the second application to perform the voice command, outputting, by the first application executing at the computing device, for display, an updated speech recognition GUI in which the at least one non-textual element, from the speech recognition GUI, transitions from being displayed in the first visual format to being displayed in a second visual format, different from the first visual format, indicating that the voice-initiated action is the particular voice-initiated action from the plurality of voice-initiated actions that has been determined from the first audio data of the voice command, wherein; the first visual format of the at least one non-textual element is a first image representative of a speech recognition mode of the first application, the second visual format of the at least one non-textual element is a second image that replaces the first image and corresponds to the voice-initiated action from the plurality of voice-initiated actions, and the second image is different from other images corresponding to one or more other voice-initiated actions from the plurality of voice-initiated actions; and after outputting the updated speech recognition GUI and after receiving the second audio data of the voice command, executing, by the computing device, based on the first audio data and the second audio data, the second application that performs the voice-initiated action indicated by the voice command. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computing device, comprising:
-
a display device; one or more processors; and a memory that stores instructions associated with a first application that when executed cause the one or more processors to; output, for display at the display device, a speech recognition graphical user interface (GUI) having at least one non-textual element in a first visual format; receive first audio data of a voice command that indicates one or more words of the voice command; determine, based on the one or more words of the voice command, a voice-initiated action indicated by the first audio data of the voice command, wherein the voice-initiated action is a particular voice-initiated action from a plurality of voice-initiated actions and the voice-initiated action is associated with a second application that is different than the first application; responsive to determining the voice-initiated action indicated by the first audio data of the voice command, and while receiving second audio data of the voice command that indicates one or more additional words of the voice command, and prior to executing the second application to perform the voice command, output, for display at the display device, an updated speech recognition GUI in which the at least one non-textual element, from the speech recognition GUI, transitions from being displayed in the first visual format to being displayed in a second visual format, different from the first visual format, indicating that the voice-initiated action is the particular voice-initiated action from the plurality of voice-initiated action that has been determined from the first audio data of the voice command, wherein; the first visual format of the at least one non-textual element is a first image representative of a speech recognition mode of the first application, the second visual format of the at least one non-textual element is a second image that replaces the first image and corresponds to the voice-initiated action from the plurality of voice-initiated actions, and the second image is different from other images corresponding to one or more other voice-initiated actions from the plurality of voice-initiated actions; and after outputting the updated speech recognition GUI and after receiving the second audio data of the voice command, execute, based on the first audio data and the second audio data, the second application that performs the voice-initiated action indicated by the voice command. - View Dependent Claims (18)
-
-
19. A non-transitory computer-readable storage medium encoded with instructions associated with a first application that, when executed, cause one or more processors of a computing device to:
-
output, for display at the display device, a speech recognition graphical user interface (GUI) having at least one non-textual element in a first visual format; receive first audio data of a voice command that indicates one or more words of the voice command; determine, based on the one or more words of the voice command, a voice-initiated action indicated by the first audio data of the voice command, wherein the voice-initiated action is a particular voice-initiated action from a plurality of voice-initiated actions and the voice-initiated action is associated with a second application that is different than the first application; responsive to determining the voice-initiated action indicated by the first audio data of the voice command, and while receiving second audio data of the voice command that indicates one or more additional words of the voice command, and prior to executing the second application to perform the voice command, output, for display at the display device, an updated speech recognition GUI in which the at least one non-textual element, from the speech recognition GUI, transitions from being displayed in the first visual format to being displayed in a second visual format, different from the first visual format, indicating that the voice-initiated action is the particular voice-initiated action from the plurality of voice-initiated action that has been determined from the first audio data of the voice command, wherein; the first visual format of the at least one non-textual element is a first image representative of a speech recognition mode of the first application, the second visual format of the at least one non-textual element is a second image that replaces the first image and corresponds to the voice-initiated action from the plurality of voice-initiated actions, and the second image is different from other images corresponding to one or more other voice-initiated actions from the plurality of voice-initiated actions; and after outputting the updated speech recognition GUI and after receiving the second audio data of the voice command, execute, based on the first audio data and the second audio data, a second application that performs the voice-initiated action indicated by the voice command. - View Dependent Claims (20)
-
Specification