Allowing spelling of arbitrary words
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving, at a user device, a first voice input from a user of the user device, the first voice input comprising a search query;
transmitting the first voice input over a network from the user device to a search system, the first voice input when received by the search system causing the search system to;
generate a transcription of the search query based on the first voice input, the transcription of the search query including one or more terms; and
transmit the transcription of the search query over the network to the user device;
displaying, by the user device, the transcription of the search query in a user interface of the user device;
receiving, at the user device, a selection indication indicating a user selection in the user interface of one of the one or more terms of the transcription of the search query displayed in the user interface;
receiving, at the user device, a second voice input from the user, the second voice input including a series of letters that spells out a correction of the selected term of the transcription of the search query; and
transmitting the second voice input over the network from the user device to the search system, the second voice input when received by the search system causing the search system to;
generate an updated transcription of the search query based on the first voice input and the second voice input;
obtain one or more search results responsive to the updated transcription of the search query; and
transmit the updated transcription of the search query and the one or more search results over the network to the user device.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice input from a user device; generating a first recognition output; receiving a user selection of one or more terms in the first recognition output; receiving a second voice input spelling a correction of the user selection; determining a corrected recognition output for the selected portion; and providing a second recognition output that merges the first recognition output and the corrected recognition output.
-
Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving, at a user device, a first voice input from a user of the user device, the first voice input comprising a search query; transmitting the first voice input over a network from the user device to a search system, the first voice input when received by the search system causing the search system to; generate a transcription of the search query based on the first voice input, the transcription of the search query including one or more terms; and transmit the transcription of the search query over the network to the user device; displaying, by the user device, the transcription of the search query in a user interface of the user device; receiving, at the user device, a selection indication indicating a user selection in the user interface of one of the one or more terms of the transcription of the search query displayed in the user interface; receiving, at the user device, a second voice input from the user, the second voice input including a series of letters that spells out a correction of the selected term of the transcription of the search query; and transmitting the second voice input over the network from the user device to the search system, the second voice input when received by the search system causing the search system to; generate an updated transcription of the search query based on the first voice input and the second voice input; obtain one or more search results responsive to the updated transcription of the search query; and transmit the updated transcription of the search query and the one or more search results over the network to the user device. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
one or more computers of a user device and one or more storage devices of the user device storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving a first voice input from a user of the user device, the first voice input comprising a search query; transmitting the first voice input over a network from the user device to a search system, the first voice input when received by the search system causing the search system to; generate a transcription of the search query based on the first voice input, the transcription of the search query including one or more terms; and transmit the transcription of the search query over the network to the user device; displaying the transcription of the search query in a user interface of the user device; receiving a selection indication indicating a user selection in the user interface of one of the one or more terms of the transcription of the search query displayed in the user interface; receiving a second voice input from the user, the second voice input including a series of letters that spells out a correction of the selected term of the transcription of the search query; and transmitting the second voice input over the network from the user device to the search system, the second voice input when received by the search system causing the search system to; generate an updated transcription of the search query based on the first voice input and the second voice input; obtain one or more search results responsive to the updated transcription of the search query; and transmit the updated transcription of the search query and the one or more search results over the network to the user device. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
15. One or more non-transitory computer-readable storage media of a user device encoded with instructions that, when executed by one or more computers of the user device, cause the one or more computers to perform operations comprising:
-
receiving a first voice input from a user of the user device, the first voice input comprising a search query; transmitting the first voice input over a network from the user device to a search system, the first voice input when received by the search system causing the search system to; generate a transcription of the search query based on the first voice input, the transcription of the search query including one or more terms; and transmit the transcription of the search query over the network to the user device; displaying the transcription of the search query in a user interface of the user device; receiving a selection indication indicating a user selection in the user interface of one of the one or more terms of the transcription of the search query displayed in the user interface; receiving a second voice input from the user, the second voice input including a series of letters that spells out a correction of the selected term of the transcription of the search query; and transmitting the second voice input over the network from the user device to the search system, the second voice input when received by the search system causing the search system to; generate an updated transcription of the search query based on the first voice input and the second voice input; obtain one or more search results responsive to the updated transcription of the search query; and transmit the updated transcription of the search query and the one or more search results over the network to the user device. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification