System and method for effectively implementing an optimized language model for speech recognition
First Claim
1. A system for optimizing speech recognition procedures, comprising:
- initial language models each iteratively created by combining source models according to interpolation coefficients that define proportional relationships for combining said source models;
a speech recognizer that utilizes said initial language models to iteratively process input development data in corresponding ones of said speech recognition procedures for calculating word-error rates that each correspond to a different one of said initial language models; and
an optimized language model selected from said initial language models by identifying an optimal word-error rate from among said word-error rates, said speech recognizer utilizing said optimized language model for performing subsequent ones of said speech recognition procedures.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for effectively implementing an optimized language model for speech recognition includes initial language models each created by combining source models according to selectable interpolation coefficients that define proportional relationships for combining the source models. A rescoring module iteratively utilizes the initial language models to process input development data for calculating word-error rates that each correspond to a different one of the initial language models. An optimized language model is then selected from the initial language models by identifying an optimal word-error rate from among the foregoing word-error rates. The speech recognizer may then utilize the optimized language model for effectively performing various speech recognition procedures.
28 Citations
45 Claims
-
1. A system for optimizing speech recognition procedures, comprising:
-
initial language models each iteratively created by combining source models according to interpolation coefficients that define proportional relationships for combining said source models; a speech recognizer that utilizes said initial language models to iteratively process input development data in corresponding ones of said speech recognition procedures for calculating word-error rates that each correspond to a different one of said initial language models; and an optimized language model selected from said initial language models by identifying an optimal word-error rate from among said word-error rates, said speech recognizer utilizing said optimized language model for performing subsequent ones of said speech recognition procedures. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method for optimizing speech recognition procedures, comprising:
-
creating initial language models by iteratively combining source models according to interpolation coefficients that define proportional relationships for combining said source models; utilizing said initial language models to iteratively process input development data in corresponding ones of said speech recognition procedures for calculating word-error rates that each correspond to a different one of said initial language models; selecting an optimized language model from said initial language models by identifying an optimal word-error rate from among said word-error rates; and utilizing said optimized language model for performing subsequent ones of said speech recognition procedures with a speech recognizer. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 43, 44, 45)
-
-
41. A system for optimizing speech recognition procedures, comprising:
-
means for creating initial language models by iteratively combining source models according to interpolation coefficients that define proportional relationships for combining said source models; means for utilizing said initial language models to iteratively process input development data in corresponding ones of said speech recognition procedures for calculating word-error rates that each correspond to a different one of said initial language models; means for selecting an optimized language model from said initial language models by identifying an optimal word-error rate from among said word-error rates; and means for utilizing said optimized language model for performing subsequent ones of said speech recognition procedures.
-
-
42. A system for optimizing speech recognition procedures, comprising:
-
initial language models each iteratively created by combining source models according to interpolation coefficients that define proportional relationships for combining said source models; a speech recognizer that utilizes said initial language models to iteratively process input development data in corresponding ones of said speech recognition procedures for calculating word-error rates that each correspond to a different one of said initial language models, said word-error rates being calculated by comparing a correct transcription of said input development data and a top recognition candidate from an N-best list that is rescored by a rescoring module for each of said initial language models; and an optimized language model selected from said initial language models by identifying an optimal word-error rate from among said word-error rates, said speech recognizer utilizing said optimized language model for performing subsequent ones of said speech recognition procedures.
-
Specification