Adapting enhanced acoustic models
First Claim
1. A computer-implemented method comprising:
- receiving voice queries submitted by at least a first user and a second user that is different from the first user;
for each of the voice queries submitted by at least the first user and the second user, generating a score indicative of a probability that a transcription of the voice query is correct, wherein the score is generated based at least on feedback information that indicates an action, other than an explicit selection of the transcription of the voice query, taken by a respective user that submitted the voice query after reviewing the transcription of the voice query,wherein, for at least a particular voice query of the voice queries, the score is generated based on feedback indicating that the user that submitted the particular voice query has selected a search result provided by a search engine, wherein the search engine provided the search result in response to receiving the transcription of the particular voice query as input to the search engine;
selecting a subset of the voice queries whose scores satisfy a threshold; and
adapting an acoustic model using the subset of the voice queries.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving voice queries, obtaining, for one or more of the voice queries, feedback information that references an action taken by a user that submitted the voice query after reviewing a result of the voice query, generating, for the one or more voice queries, a posterior recognition confidence measure that reflects a probability that the voice query was correctly recognized, wherein the posterior recognition confidence measure is generated based at least on the feedback information for the voice query, selecting a subset of the one or more voice queries based on the posterior recognition confidence measures, and adapting an acoustic model using the subset of the voice queries.
-
Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving voice queries submitted by at least a first user and a second user that is different from the first user; for each of the voice queries submitted by at least the first user and the second user, generating a score indicative of a probability that a transcription of the voice query is correct, wherein the score is generated based at least on feedback information that indicates an action, other than an explicit selection of the transcription of the voice query, taken by a respective user that submitted the voice query after reviewing the transcription of the voice query, wherein, for at least a particular voice query of the voice queries, the score is generated based on feedback indicating that the user that submitted the particular voice query has selected a search result provided by a search engine, wherein the search engine provided the search result in response to receiving the transcription of the particular voice query as input to the search engine; selecting a subset of the voice queries whose scores satisfy a threshold; and adapting an acoustic model using the subset of the voice queries. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving voice queries submitted by at least a first user and a second user that is different from the first user; for each of the voice queries submitted by at least the first user and the second user, generating a score indicative of a probability that a transcription of the voice query is correct, wherein the score is generated based at least on feedback information that indicates an action, other than an explicit selection of the transcription of the voice query, taken by a respective user that submitted the voice query after reviewing the transcription of the voice query, wherein, for at least a particular voice query of the voice queries, the score is generated based on feedback indicating that the user that submitted the particular voice query has selected a search result provided by a search engine, wherein the search engine provided the search result in response to receiving the transcription of the particular voice query as input to the search engine; selecting a subset of the voice queries whose scores satisfy a threshold; and adapting an acoustic model using the subset of the voice queries. - View Dependent Claims (16, 17)
-
-
18. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving voice queries submitted by at least a first user and a second user that is different from the first user; for each of the voice queries submitted by at least the first user and the second user, generating a score indicative of a probability that a transcription of the voice query is correct, wherein the score is generated based at least on feedback information that indicates an action, other than an explicit selection of the transcription of the voice query, taken by a respective user that submitted the voice query after reviewing the transcription of the voice query, wherein, for at least a particular voice query of the voice queries, the score is generated based on feedback indicating that the user that submitted the particular voice query has selected a search result provided by a search engine, wherein the search engine provided the search result in response to receiving the transcription of the particular voice query as input to the search engine; selecting a subset of the voice queries whose scores satisfy a threshold; and adapting an acoustic model using the subset of the voice queries. - View Dependent Claims (19, 20)
-
Specification