System and method for speech-based navigation and interaction with a device's visible screen elements using a corresponding view hierarchy
First Claim
1. A computer-implemented method comprising:
- providing, for display, a first user interface that includes a first user interface element that corresponds to a text string and a second, different user interface element that corresponds to the text string;
receiving audio data of an utterance that includes the text string;
providing, for display, a second user interface that indicates that both the first user interface element and the second, different user interface element correspond to the utterance;
receiving data indicating a selection of a particular one of the first user interface element or the second, different user interface element; and
in response to receiving the data indicating the selection of the particular one of the first user interface element or the second, different user interface element;
performing an action corresponding to the particular one of the first user interface element or the second, different user interface element; and
storing data indicating to perform the action corresponding to the particular one of the first user interface element or the second, different user interface element (i) in response to receiving additional audio data of an additional utterance that includes the text string, (ii) without providing, for display, the second user interface, and (iii) without receiving data indicating an additional selection of the particular one of the first user interface element or the second, different user interface element.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enabling screen-specific user interfacing with elements of viewable screens presented by an electronic device are disclosed. In one aspect, a method includes the actions of identifying a character sequence representing a first input that is received while displaying a viewable screen having at least one selectable viewable element. The actions further include accessing an electronic file that provides a text representation of one or more of the at least one selectable viewable element. The actions further include comparing the character sequence to the text representation. The actions further include selecting, within the viewable screen, a selectable viewable element whose text representation matches the character sequence. The actions further include triggering any action linked to the selecting the selectable viewable element.
27 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
providing, for display, a first user interface that includes a first user interface element that corresponds to a text string and a second, different user interface element that corresponds to the text string; receiving audio data of an utterance that includes the text string; providing, for display, a second user interface that indicates that both the first user interface element and the second, different user interface element correspond to the utterance; receiving data indicating a selection of a particular one of the first user interface element or the second, different user interface element; and in response to receiving the data indicating the selection of the particular one of the first user interface element or the second, different user interface element; performing an action corresponding to the particular one of the first user interface element or the second, different user interface element; and storing data indicating to perform the action corresponding to the particular one of the first user interface element or the second, different user interface element (i) in response to receiving additional audio data of an additional utterance that includes the text string, (ii) without providing, for display, the second user interface, and (iii) without receiving data indicating an additional selection of the particular one of the first user interface element or the second, different user interface element. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; providing, for display, a first user interface that includes a first user interface element that corresponds to a text string and a second, different user interface element that corresponds to the text string; receiving audio data of an utterance that includes the text string; providing, for display, a second user interface that indicates that both the first user interface element and the second, different user interface element correspond to the utterance; receiving data indicating a selection of a particular one of the first user interface element or the second, different user interface element; and in response to receiving the data indicating the selection of the particular one of the first user interface element or the second, different user interface element; performing an action corresponding to the particular one of the first user interface element or the second, different user interface element; and storing data indicating to perform the action corresponding to the particular one of the first user interface element or the second, different user interface element (i) in response to receiving additional audio data of an additional utterance that includes the text string, (ii) without providing, for display, the second user interface, and (iii) without receiving data indicating an additional selection of the particular one of the first user interface element or the second, different user interface element. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
19. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
providing, for display, a first user interface that includes a first user interface element that corresponds to a text string and a second, different user interface element that corresponds to the text string; receiving audio data of an utterance that includes the text string; providing, for display, a second user interface that indicates that both the first user interface element and the second, different user interface element correspond to the utterance; receiving data indicating a selection of a particular one of the first user interface element or the second, different user interface element; and in response to receiving the data indicating the selection of the particular one of the first user interface element or the second, different user interface element; performing an action corresponding to the particular one of the first user interface element or the second, different user interface element; and storing data indicating to perform the action corresponding to the particular one of the first user interface element or the second, different user interface element (i) in response to receiving additional audio data of an additional utterance that includes the text string, (ii) without providing, for display, the second user interface, and (iii) without receiving data indicating an additional selection of the particular one of the first user interface element or the second, different user interface element. - View Dependent Claims (20)
-
Specification