Training speaker-dependent, phrase-based speech grammars using an unsupervised automated technique
First Claim
1. A method for tuning grammar option weights of a phrase-based, automatic speech recognition (ASR) grammar in an unsupervised fashion comprising the steps of:
- recording feedback of ASR phrase processing operations during a communication session, wherein each ASR phrase processing operation matches a spoken utterance against at least one entry within a speaker dependent, phrase-based grammar, said grammar having a plurality of grammar option weights, wherein the grammar option weights affect which entries are matched to the spoken utterances;
for each of the ASR phase processing operations, determining whether the phrase processing operation was successfully performed based upon the feedback; and
for each of the ASR phase processing operations, automatically adjusting at least one of the grammar option weights based upon results of the determining step to improve accuracy of the grammar.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention can include a method for tuning grammar option weights of a phrase-based, automatic speech recognition (ASR) grammar, where the grammar option weights affect which entries within the grammar are matched to spoken utterances. The tuning can occur in an unsupervised fashion, meaning no special training session or manual transcription of data from an ASR session is needed. The method can include the step of selecting a phrase-based grammar to use in a communication session with a user wherein different phrase-based grammars can be selected for different users. Feedback of ASR phrase processing operations can be recorded during the communication session. Each ASR phrase processing operation can match a spoken utterance against at least one entry within the selected phrase-based grammar. At least one of the grammar option weights can be automatically adjusted based upon the feedback to improve accuracy of the phrase-based grammar.
-
Citations
20 Claims
-
1. A method for tuning grammar option weights of a phrase-based, automatic speech recognition (ASR) grammar in an unsupervised fashion comprising the steps of:
-
recording feedback of ASR phrase processing operations during a communication session, wherein each ASR phrase processing operation matches a spoken utterance against at least one entry within a speaker dependent, phrase-based grammar, said grammar having a plurality of grammar option weights, wherein the grammar option weights affect which entries are matched to the spoken utterances;
for each of the ASR phase processing operations, determining whether the phrase processing operation was successfully performed based upon the feedback; and
for each of the ASR phase processing operations, automatically adjusting at least one of the grammar option weights based upon results of the determining step to improve accuracy of the grammar. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 12, 13)
-
-
10. A machine-readable storage having stored thereon, an automatic speech recognition (ASR) computer program having a plurality of code sections, said code sections executable by a machine for causing the machine to perform the steps of:
-
identifying a phrase-based grammar to use in a communication session with a user, wherein different phrase-based grammars are used for different users;
recording feedback of ASR phrase processing operations during the communication sessions wherein each ASR phrase processing operation matches a spoken utterance against at least one entry within the identified phrase-based grammar said phrase-based grammar having a plurality of grammar option weights, wherein the grammar option weights affect which entries are matched to the spoken utterances; and
automatically adjusting at least one of the grammar option weights based upon the feedback to improve accuracy of the identified phrase-based grammar. - View Dependent Claims (11, 14, 15)
-
-
16. An automatic speech recognition (ASR) system comprising:
-
an identification unit configured to match a speaker to a speaker-dependent ASR grammar;
an information collection unit configured to gather feedback in real-time concerning whether a plurality of utterances have been correctly processed by the ASR grammar during an ASR session involving the speaker; and
a logic unit configured to utilize said feedback to tune the ASR grammar, wherein when an utterance has been correctly processed, at least one parameter in the ASR grammar is adjusted to increase a likelihood that the ASR system processes phrases in a similar fashion in future ASR operations involving the ASR grammar, and when an utterance has been incorrectly processed, at least one parameter in the ASR grammar is adjusted to decrease a likelihood that the ASR system processes phrases in a similar fashion in future ASR operations involving the ASR grammar. - View Dependent Claims (17, 18, 19, 20)
-
Specification