Representation of a deleted interpolation N-gram language model in ARPA standard format
First Claim
Patent Images
1. A method of storing parameters of a deleted interpolation language model, the method comprising:
- obtaining a set of parameters for the deleted interpolation language model, wherein the parameters of the deleted interpolation language model allow an N-gram probability to be determined as a linear interpolation of a relative frequency estimate for the N-gram and a probability for a lower order n-gram; and
storing at least one parameter for the deleted interpolation language model as a parameter for a backoff language model, wherein the backoff language model replaces an N-gram probability with a lower order n-gram and a backoff weight for any N-gram that cannot be located in the backoff language model.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus are provided for storing parameters of a deleted interpolation language model as parameters of a backoff language model. In particular, the parameters of the deleted interpolation language model are stored in the standard ARPA format. Under one embodiment, the deleted interpolation language model parameters are formed using fractional counts.
-
Citations
20 Claims
-
1. A method of storing parameters of a deleted interpolation language model, the method comprising:
-
obtaining a set of parameters for the deleted interpolation language model, wherein the parameters of the deleted interpolation language model allow an N-gram probability to be determined as a linear interpolation of a relative frequency estimate for the N-gram and a probability for a lower order n-gram; and storing at least one parameter for the deleted interpolation language model as a parameter for a backoff language model, wherein the backoff language model replaces an N-gram probability with a lower order n-gram and a backoff weight for any N-gram that cannot be located in the backoff language model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-readable storage medium having encoded thereon computer-executable instructions for performing steps comprising:
-
identifying a parameter for a deleted interpolation language model that forms probabilities through interpolations of values; and placing the parameter in a data structure as a backoff parameter for a backoff language model that substitutes a weighted lower order n-gram probability for an N-gram probability when the N-gram cannot be located in the backoff language model. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A method for constructing a language model, the method comprising:
-
using deleted interpolation to train parameters for a language model; storing at least some of the trained parameters in a data structure conforming to the ARPA format for backoff language models. - View Dependent Claims (19, 20)
-
Specification