Correcting voice recognition using selective re-speak
First Claim
1. A computer-implemented method executed by a server system comprising one or more computers, the method comprising:
- providing, by the server system, first text for display on a computing device of a user, the first text being received from a first speech recognition engine, the first speech recognition engine having converted first speech received from the computing device into the first text by processing the first speech to generate multiple potential texts and associating each of the multiple potential texts with a respective plurality of entities, and the first text being displayed as a search query prior to executing the search query to obtain search results;
receiving, by the server system, a speech correction indication from the computing device, the speech correction indication (i) initiating a correction of the first text, (ii) providing context to select a portion of the first text that is to be corrected without explicitly indicating the portion of the first text to be corrected and without repeating the first text, and (iii) providing context for selecting second text to correct the portion of the first text without explicitly reciting the second text prior to executing the search query to obtain search results;
processing, by the server system, the speech correction indication to determine both (i) the portion of the first text that is to be corrected and (ii) the second text to correct the portion of the first text prior to executing the search query to obtain search results, the second text determined based on associating second speech with a second respective plurality of entities and selecting as the second text one of the multiple potential texts generated from the first speech and associated with the respective plurality of entities that best matches the second respective plurality of entities associated with the second speech;
replacing, by the server system, the portion of the first text with the second text to provide a combined text prior to executing the search query to obtain search results; and
providing, by the server system, the combined text for display on the computing device as a revised search query.
2 Assignments
0 Petitions
Accused Products
Abstract
Implementations of the present disclosure include actions of providing first text for display on a computing device of a user, the first text being provided from a first speech recognition engine based on first speech received from the computing device, and being displayed as a search query, receiving a speech correction indication from the computing device, the speech correction indication indicating a portion of the first text that is to be corrected, receiving second speech from the computing device, receiving second text from a second speech recognition engine based on the second speech, the second speech recognition engine being different from the first speech recognition engine, replacing the portion of the first text with the second text to provide a combined text, and providing the combined text for display on the computing device as a revised search query.
183 Citations
15 Claims
-
1. A computer-implemented method executed by a server system comprising one or more computers, the method comprising:
-
providing, by the server system, first text for display on a computing device of a user, the first text being received from a first speech recognition engine, the first speech recognition engine having converted first speech received from the computing device into the first text by processing the first speech to generate multiple potential texts and associating each of the multiple potential texts with a respective plurality of entities, and the first text being displayed as a search query prior to executing the search query to obtain search results; receiving, by the server system, a speech correction indication from the computing device, the speech correction indication (i) initiating a correction of the first text, (ii) providing context to select a portion of the first text that is to be corrected without explicitly indicating the portion of the first text to be corrected and without repeating the first text, and (iii) providing context for selecting second text to correct the portion of the first text without explicitly reciting the second text prior to executing the search query to obtain search results; processing, by the server system, the speech correction indication to determine both (i) the portion of the first text that is to be corrected and (ii) the second text to correct the portion of the first text prior to executing the search query to obtain search results, the second text determined based on associating second speech with a second respective plurality of entities and selecting as the second text one of the multiple potential texts generated from the first speech and associated with the respective plurality of entities that best matches the second respective plurality of entities associated with the second speech; replacing, by the server system, the portion of the first text with the second text to provide a combined text prior to executing the search query to obtain search results; and providing, by the server system, the combined text for display on the computing device as a revised search query. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system comprising:
-
one or more computers and one or more storage devices on which are stored instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; providing first text for display on a computing device of a user, the first text being received from a first speech recognition engine, the first speech recognition engine having converted first speech received from the computing device into the first text by processing the first speech to generate multiple potential texts and associating each of the multiple potential texts with a respective plurality of entities, and the first text being displayed as a search query prior to executing the search query to obtain search results; receiving a speech correction indication from the computing device, the speech correction indication (i) initiating a correction of the first text, (ii) providing context to select a portion of the first text that is to be corrected without explicitly indicating the portion of the first text to be corrected and without repeating the first text, and (iii) providing context for selecting second text to correct the portion of the first text without explicitly reciting the second text prior to executing the search query to obtain search results; processing the speech correction indication to determine both (i) the portion of the first text that is to be corrected and (ii) the second text to correct the portion of the first text prior to executing the search query to obtain search results, the second text determined based on associating second speech with a second respective plurality of entities and selecting as the second text one of the multiple potential texts generated from the first speech and associated with the respective plurality of entities that best matches the second respective plurality of entities associated with the second speech; replacing the portion of the first text with the second text to provide a combined text prior to executing the search query to obtain search results; and providing the combined text for display on the computing device as a revised search query. - View Dependent Claims (7, 8, 9, 10, 11)
-
-
12. A non-transitory computer readable medium storing instructions that, when executed by one or more computers, cause the one or more computers to perform operations comprising:
-
providing first text for display on a computing device of a user, the first text being received from a first speech recognition engine, the first speech recognition engine having converted first speech received from the computing device into the first text by processing the first speech to generate multiple potential texts and associating each of the multiple potential texts with a respective plurality of entities, and the first text being displayed as a search query prior to executing the search query to obtain search results; receiving a speech correction indication from the computing device, the speech correction indication (i) initiating a correction of the first text, (ii) providing context to select a portion of the first text that is to be corrected without explicitly indicating the portion of the first text to be corrected and without repeating the first text, and (iii) providing context for selecting second text to correct the portion of the first text without explicitly reciting the second text prior to executing the search query to obtain search results; processing the speech indication to determine both (i) the portion of the first text that is to be corrected and (ii) the second text to correct the portion of the first text prior to executing the search query to obtain search results, the second text determined based on associating second speech with a second respective plurality of entities and selecting as the second text one of the multiple potential texts generated from the first speech and associated with the respective plurality of entities that best matches the second respective plurality of entities associated with the second speech; replacing the portion of the first text with the second text to provide a combined text prior to executing the search query to obtain search results; and providing the combined text for display on the computing device as a revised search query. - View Dependent Claims (13, 14, 15)
-
Specification