Adapting enhanced acoustic models
First Claim
1. A system comprising:
- one or more computers; and
a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising;
receiving voice queries submitted by different users,obtaining, for each of a plurality of the voice queries, feedback information that references an action taken by a user that submitted the voice query after reviewing a result of the voice query,determining, for each of the plurality of the voice queries, a speech recognizer confidence measure for the voice query,generating, for each of the plurality of the voice queries, a posterior recognition confidence measure that reflects a probability that the voice query was correctly recognized, wherein the posterior recognition confidence measure is generated based at least on the feedback information for the voice query and a speech recognizer confidence measure for the voice query, wherein generating the posterior recognition confidence measures comprises;
generating the posterior recognition confidence measure for at least one of the plurality of the voice queries based on information that identifies that a user revealed an alternates list on a user interface of the mobile device, in which case the posterior recognition confidence measure is adjusted to indicate an increased probability that the voice query was correctly recognized; and
generating the posterior recognition confidence measure for at least one of the plurality of the voice queries based on information that identifies that a user did not reveal an alternates list, in which case the posterior recognition confidence measure is adjusted to indicate a decreased probability that the voice query was correctly recognized;
selecting a subset of the plurality of the voice queries for adapting an acoustic model based on the generated posterior recognition confidence measures, wherein the subset includes voice queries submitted by different users, andadapting the acoustic model using the subset of the voice queries.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving voice queries, obtaining, for one or more of the voice queries, feedback information that references an action taken by a user that submitted the voice query after reviewing a result of the voice query, generating, for the one or more voice queries, a posterior recognition confidence measure that reflects a probability that the voice query was correctly recognized, wherein the posterior recognition confidence measure is generated based at least on the feedback information for the voice query, selecting a subset of the one or more voice queries based on the posterior recognition confidence measures, and adapting an acoustic model using the subset of the voice queries.
-
Citations
22 Claims
-
1. A system comprising:
-
one or more computers; and a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising; receiving voice queries submitted by different users, obtaining, for each of a plurality of the voice queries, feedback information that references an action taken by a user that submitted the voice query after reviewing a result of the voice query, determining, for each of the plurality of the voice queries, a speech recognizer confidence measure for the voice query, generating, for each of the plurality of the voice queries, a posterior recognition confidence measure that reflects a probability that the voice query was correctly recognized, wherein the posterior recognition confidence measure is generated based at least on the feedback information for the voice query and a speech recognizer confidence measure for the voice query, wherein generating the posterior recognition confidence measures comprises; generating the posterior recognition confidence measure for at least one of the plurality of the voice queries based on information that identifies that a user revealed an alternates list on a user interface of the mobile device, in which case the posterior recognition confidence measure is adjusted to indicate an increased probability that the voice query was correctly recognized; and generating the posterior recognition confidence measure for at least one of the plurality of the voice queries based on information that identifies that a user did not reveal an alternates list, in which case the posterior recognition confidence measure is adjusted to indicate a decreased probability that the voice query was correctly recognized; selecting a subset of the plurality of the voice queries for adapting an acoustic model based on the generated posterior recognition confidence measures, wherein the subset includes voice queries submitted by different users, and adapting the acoustic model using the subset of the voice queries. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer readable storage medium encoded with a computer program, the program comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
receiving voice queries submitted by different users, obtaining, for each of a plurality of the voice queries, feedback information that references an action taken by a user that submitted the voice query after reviewing a result of the voice query, determining, for each of the plurality of the voice queries, a speech recognizer confidence measure for the voice query, generating, for each of the plurality of the voice queries, a posterior recognition confidence measure that reflects a probability that the voice query was correctly recognized, wherein the posterior recognition confidence measure is generated based at least on the feedback information for the voice query and a speech recognizer confidence measure for the voice query, wherein generating the posterior recognition confidence measures comprises; generating the posterior recognition confidence measure for at least one of the plurality of the voice queries based on information that identifies that a user revealed an alternates list on a user interface of the mobile device, in which case the posterior recognition confidence measure is adjusted to indicate an increased probability that the voice query was correctly recognized; and generating the posterior recognition confidence measure for at least one of the plurality of the voice queries based on information that identifies that a user did not reveal an alternates list, in which case the posterior recognition confidence measure is adjusted to indicate a decreased probability that the voice query was correctly recognized; selecting a subset of the plurality of the voice queries for adapting an acoustic model based on the generated posterior recognition confidence measures, wherein the subset includes voice queries submitted by different users, and adapting the acoustic model using the subset of the voice queries.
-
-
18. A computer-implemented method comprising:
-
receiving voice queries submitted by different users, obtaining, for each of a plurality of the voice queries, feedback information that references an action taken by a user that submitted the voice query after reviewing a result of the voice query, determining, for each of the plurality of the voice queries, a speech recognizer confidence measure for the voice query, generating, for each of the plurality of the voice queries, a posterior recognition confidence measure that reflects a probability that the voice query was correctly recognized, wherein the posterior recognition confidence measure is generated based at least on the feedback information for the voice query and a speech recognizer confidence measure for the voice query, wherein generating the posterior recognition confidence measures comprises; generating the posterior recognition confidence measure for at least one of the plurality of the voice queries based on information that identifies that a user revealed an alternates list on a user interface of the mobile device, in which case the posterior recognition confidence measure is adjusted to indicate an increased probability that the voice query was correctly recognized; and generating the posterior recognition confidence measure for at least one of the plurality of the voice queries based on information that identifies that a user did not reveal an alternates list, in which case the posterior recognition confidence measure is adjusted to indicate a decreased probability that the voice query was correctly recognized; selecting a subset of the plurality of the voice queries for adapting an acoustic model based on the generated posterior recognition confidence measures, wherein the subset includes voice queries submitted by different users, and adapting the acoustic model using the subset of the voice queries. - View Dependent Claims (19, 20)
-
-
21. A system comprising:
-
one or more computers; and a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising; receiving voice queries submitted by different users, determining, for each of a plurality of the voice queries, a speech recognizer confidence measure for the voice query, obtaining, for each of the plurality of the voice queries, feedback information that references an action taken by a user that submitted the voice query after reviewing a result of the voice query, determining, for each of the plurality of the voice queries and based on the feedback information, whether the user took a particular action after receiving the result of the voice query, generating, for each of the plurality of the voice queries, a posterior recognition confidence measure that reflects a probability that the voice query was correctly recognized, wherein the posterior recognition confidence measure is based at least on a speech recognizer confidence measure for the voice query, wherein generating the posterior recognition confidence measures comprises; generating the posterior recognition confidence measure for at least one of the plurality of the voice queries based on information that identifies that a user revealed an alternates list on a user interface of the mobile device, in which case the posterior recognition confidence measure is adjusted to indicate an increased probability that the voice query was correctly recognized; and generating the posterior recognition confidence measure for at least one of the plurality of the voice queries based on information that identifies that a user did not reveal an alternates list, in which case the posterior recognition confidence measure is adjusted to indicate a decreased probability that the voice query was correctly recognized; selecting an acoustic model adaption subset of the plurality of voice queries for which the user is determined to have taken the particular action and for which the generated posterior recognition confidence measure satisfies a predetermined threshold, the acoustic model adaption subset comprising voice queries from different users, and adapting an acoustic model using the acoustic model adaptation subset of the voice queries. - View Dependent Claims (22)
-
Specification