Evaluating pronouns in context
First Claim
1. A computer-implemented method comprising:
- receiving, by a speech recognition engine, data indicating a selection of a control for initiating speech recognition that is presented by a referring application;
receiving, by a speech recognition engine, audio data that encodes an utterance;
determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun;
transmitting, from the speech recognition engine to the referring application, a disambiguation request, wherein the disambiguation request identifies the pronoun;
receiving, by the speech recognition engine, a response to the disambiguation request from the referring application, wherein the response references an item of content identified by the application; and
generating, by the speech recognition engine, the command using the keywords and the response.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, computer program products, and systems are described for receiving, by a speech recognition engine, audio data that encodes an utterance and determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun. In addition, the methods, computer program products, and systems described herein pertain to transmitting a disambiguation request to an application, wherein the disambiguation request identifies the pronoun, receiving, by the speech recognition engine, a response to the disambiguation request, wherein the response references an item of content identified by the application, and generating, by the speech recognition engine, the command using the keywords and the response.
-
Citations
23 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a speech recognition engine, data indicating a selection of a control for initiating speech recognition that is presented by a referring application; receiving, by a speech recognition engine, audio data that encodes an utterance; determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun; transmitting, from the speech recognition engine to the referring application, a disambiguation request, wherein the disambiguation request identifies the pronoun; receiving, by the speech recognition engine, a response to the disambiguation request from the referring application, wherein the response references an item of content identified by the application; and generating, by the speech recognition engine, the command using the keywords and the response. - View Dependent Claims (2, 3, 4, 5, 6, 7, 9, 10, 11)
-
-
8. A computer-implemented method comprising:
-
determining, by a referring application, that a control for initiating speech recognition that is presented by the referring application has been selected; receiving, by the referring application, a disambiguation request from a speech recognition engine, wherein the disambiguation request identifies a pronoun; identifying, by the referring application, an item of content based on the pronoun; and transmitting, by the referring application, a response to the disambiguation request, wherein the response references the item of content.
-
-
12. A non-transitory computer-readable medium having instructions encoded thereon, which, when executed by a processor, cause the processor to perform operations comprising:
-
receiving, by a speech recognition engine, data indicating a selection of a control for initiating speech recognition that is presented by a referring application; receiving, by a speech recognition engine, audio data that encodes an utterance; determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun; transmitting, from the speech recognition engine to the referring application, a disambiguation request, wherein the disambiguation request identifies the pronoun; receiving, by the speech recognition engine, a response to the disambiguation request from the referring application, wherein the response references an item of content identified by the application; and generating, by the speech recognition engine, the command using the keywords and the response. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, by a speech recognition engine, data indicating a selection of a control for initiating speech recognition that is presented by a referring application; receiving, by a speech recognition engine, audio data that encodes an utterance; determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun; transmitting, from the speech recognition engine to the referring application, a disambiguation request, wherein the disambiguation request identifies the pronoun; receiving, by the speech recognition engine, a response to the disambiguation request from the referring application, wherein the response references an item of content identified by the application; and generating, by the speech recognition engine, the command using the keywords and the response. - View Dependent Claims (19, 20, 21, 22, 23)
Specification