Evaluating pronouns in context
First Claim
1. A computer-implemented method comprising:
- obtaining, by a speech recognition engine implemented on a mobile computing device, a transcription of an utterance encoded in an audio signal;
determining, by the speech recognition engine, that the transcription includes a pronoun and one or more keywords associated with a command;
disambiguating, by the speech recognition engine, the pronoun based on an item of content that is identified by a referring application, wherein the referring application is an application executing on the mobile computing device through which recording of the audio signal was initiated;
generating, by the speech recognition engine, the command using the keywords and the disambiguated pronoun; and
submitting the generated command for execution.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, computer program products, and systems are described for receiving, by a speech recognition engine, audio data that encodes an utterance and determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun. In addition, the methods, computer program products, and systems described herein pertain to transmitting a disambiguation request to an application, wherein the disambiguation request identifies the pronoun, receiving, by the speech recognition engine, a response to the disambiguation request, wherein the response references an item of content identified by the application, and generating, by the speech recognition engine, the command using the keywords and the response.
-
Citations
20 Claims
-
1. A computer-implemented method comprising:
-
obtaining, by a speech recognition engine implemented on a mobile computing device, a transcription of an utterance encoded in an audio signal; determining, by the speech recognition engine, that the transcription includes a pronoun and one or more keywords associated with a command; disambiguating, by the speech recognition engine, the pronoun based on an item of content that is identified by a referring application, wherein the referring application is an application executing on the mobile computing device through which recording of the audio signal was initiated; generating, by the speech recognition engine, the command using the keywords and the disambiguated pronoun; and submitting the generated command for execution. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; obtaining, by a speech recognition engine implemented on a mobile computing device, a transcription of an utterance encoded in an audio signal; determining, by the speech recognition engine, that the transcription includes a pronoun and one or more keywords associated with a command; disambiguating, by the speech recognition engine, the pronoun based on an item of content that is identified by a referring application, wherein the referring application is an application executing on the mobile computing device through which recording of the audio signal was initiated; generating, by the speech recognition engine, the command using the keywords and the disambiguated pronoun; and submitting the generated command for execution. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
obtaining, by a speech recognition engine implemented on a mobile computing device, a transcription of an utterance encoded in an audio signal; determining, by the speech recognition engine, that the transcription includes a pronoun and one or more keywords associated with a command; disambiguating, by the speech recognition engine, the pronoun based on an item of content that is identified by a referring application, wherein the referring application is an application executing on the mobile computing device through which recording of the audio signal was initiated; generating, by the speech recognition engine, the command using the keywords and the disambiguated pronoun; and submitting the generated command for execution. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification