Automatic grammar tuning using statistical language model generation
First Claim
1. A speech processing method comprising acts of:
- utilizing an original speech recognition grammar in a speech recognition system to perform first speech recognition operations for a plurality of recognition instances,the original speech recognition grammar being a grammar-based language model grammar, the first speech recognition operations comprising using the original speech recognition grammar to process first audio data that represents speech utterances;
storing instance data generated based on the first speech recognition operations performed using the original speech recognition grammar;
automatically generating a replacement grammar from the stored instance data, comprising determining, based on at least in part the stored instance data, a number of times at which at least one word or phrase was recognized in the first speech recognition operations, wherein the replacement grammar is a statistical language model grammar;
selectively replacing the original speech recognition grammar in the speech recognition system with the replacement grammar; and
utilizing the replacement grammar to perform second speech recognition operations comprising processing second audio data;
generating additional instance data based on the second speech recognition operations;
tuning the replacement grammar based on the additional instance data.
3 Assignments
0 Petitions
Accused Products
Abstract
The present invention discloses a speech processing solution that utilizes an original speech recognition grammar in a speech recognition system to perform speech recognition operations for multiple recognition instances. Instance data associated with the recognition operations can be stored. A replacement grammar can be automatically generated from the stored instance data, where the replacement grammar is a statistical language model grammar. The original speech recognition grammar, which can be a grammar-based language model grammar or a statistical language model grammar, can be selectively replaced with the replacement grammar. For example when tested performance for the replacement grammar is better than that for the original grammar, the replacement grammar can replace the original grammar.
-
Citations
24 Claims
-
1. A speech processing method comprising acts of:
-
utilizing an original speech recognition grammar in a speech recognition system to perform first speech recognition operations for a plurality of recognition instances, the original speech recognition grammar being a grammar-based language model grammar, the first speech recognition operations comprising using the original speech recognition grammar to process first audio data that represents speech utterances; storing instance data generated based on the first speech recognition operations performed using the original speech recognition grammar; automatically generating a replacement grammar from the stored instance data, comprising determining, based on at least in part the stored instance data, a number of times at which at least one word or phrase was recognized in the first speech recognition operations, wherein the replacement grammar is a statistical language model grammar; selectively replacing the original speech recognition grammar in the speech recognition system with the replacement grammar; and
utilizing the replacement grammar to perform second speech recognition operations comprising processing second audio data;generating additional instance data based on the second speech recognition operations; tuning the replacement grammar based on the additional instance data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method comprising acts of:
-
performing a first plurality of speech-to-text operations using an original speech recognition grammar in a speech recognition system, wherein the original speech recognition grammar is a grammar-based language model grammar, and wherein the first plurality of speech-to-text operations comprise using the original speech recognition grammar to process first audio data that represents speech utterances; recording recognition instance data generated based on the first plurality of speech-to-text operations performed using the original speech recognition grammar; automatically creating a set of words and phrases from the recorded recognition instance data; automatically generating a replacement grammar from the set of words and phrases, comprising determining, based at least in part on the recorded recognition instance data, a number of times at which at least one word or phrase of the set of words and phrases was recognized in the first plurality of speech-to-text operations, wherein the replacement grammar is a statistical language model grammar; generating additional instance data based on the second speech recognition operations; tuning the replacement grammar based on the additional instance data. - View Dependent Claims (12, 13)
-
-
14. A speech recognition system comprising:
-
a language model processor configured to utilize an original speech recognition grammar in performing first speech recognition operations comprising using the original speech recognition grammar to process first audio data that represents speech utterances, the original speech recognition grammar being a grammar-based language model grammar; a log data store configured to store speech instance data generated based on the first speech recognition operations performed using the original speech recognition grammar; a statistical language model generator configured to automatically generate a replacement grammar from the speech instance data at least in part by determining, based at least in part on the speech instance data, a number of times at which at least one word or phrase was recognized in the first speech recognition operations; and a grammar swapper configured to selectively replace the original speech recognition grammar with the speech replacement grammar, wherein the language model processor is further configured to utilize the replacement grammar to perform second speech recognition operations comprising processing second audio data; generating additional instance data based on the second speech recognition operations; tuning the replacement grammar based on the additional instance data. - View Dependent Claims (15, 16, 17)
-
-
18. At least one computer readable recording non-transitory medium having encoded thereon instructions that, when executed by at least one processor, perform a speech processing method comprising acts of:
-
utilizing an original speech recognition grammar in a speech recognition system to perform first speech recognition operations for a plurality of recognition instances, the original speech recognition grammar being a grammar-based language model grammar, the first speech recognition operations comprising using the original speech recognition grammar to process first audio data that represents speech utterances; storing instance data generated based on first speech recognition operations performed using the original speech recognition grammar; automatically generating a replacement grammar from the stored instance data, comprising determining, based at least in part on the stored instance data, a number of times at which at least one word or phrase was recognized in the first speech recognition operations, wherein the replacement grammar is a statistical language model grammar; selectively replacing the original speech recognition grammar in the speech recognition system with the replacement grammar; and utilizing the replacement grammar to perform second speech recognition operations comprising processing second audio data; generating additional instance data based on the second speech recognition operations; tuning the replacement grammar based on the additional instance data. - View Dependent Claims (19, 20, 21)
-
-
22. A speech processing method comprising acts:
-
utilizing an original speech recognition grammar in a speech recognition system to perform speech recognition operations for a plurality of recognition instances, the original speech recognition grammar being a grammar-based language model grammar, the speech recognition operations comprising using the original speech recognition grammar to process audio data that represents speech utterances and was not used in generating the original speech recognition grammar; storing instance data generated based on the speech recognition operations performed using the original speech recognition grammar; automatically generating a replacement grammar from the stored instance data, comprising determining, based at least in part on the stored instance data, a number of times at which at least one word or phrase was recognized in the speech recognition operations, wherein the replacement grammar is a statistical language model grammar; and selectively replacing the original speech recognition grammar in the speech recognition system with the replacement grammar; generating additional instance data based on the second speech recognition operations; tuning the replacement grammar based on the additional instance data.
-
-
23. A speech recognition system comprising:
-
a language model processor configured to utilize an original speech recognition grammar in performing speech recognition operations comprising using the original speech recognition grammar to process audio data that represents speech utterances and was not used in generating the original speech recognition grammar, the original speech recognition grammar being a grammar-based language model grammar; a log data store configured to store speech instance data generated based on the speech recognition operations performed using the original speech recognition grammar; a statistical language model generator configured to automatically generate a replacement grammar from the speech instance data at least in part by determining, based at least in part on the speech instance data, a number of times at which at least one word or phrase was recognized in the speech recognition operations; and a grammar swapper configured to selectively replace the original speech recognition grammar with the speech replacement grammar; generating additional instance data based on the second speech recognition operations; tuning the replacement grammar based on the additional instance data.
-
-
24. At least one computer readable recording non-transitory medium having encoded thereon instructions that, when executed by at least one processor, perform speech processing method comprising acts of:
-
utilizing an original speech recognition grammar in a speech recognition system to perform speech recognition operations for a plurality of recognition instances, the original speech recognition grammar being a grammar-based language model grammar, the speech recognition operations comprising using the original speech recognition grammar to process audio data that represents speech utterances and was not used in generating the original speech recognition grammar; storing instance data generated based on the speech recognition operations performed using the original speech recognition grammar; automatically generating a replacement grammar from the stored instance data, comprising determining, based at least in part on the stored instance data, a number of times at which at least one word or phrase was recognized in the speech recognition operations, wherein the replacement grammar is a statistical language model grammar; and selectively replacing the original speech recognition grammar in the speech recognition system with the replacement grammar; generating additional instance data based on the second speech recognition operations; tuning the replacement grammar based on the additional instance data.
-
Specification