Method of determining parameters of a statistical language model
First Claim
1. A method of determining parameters of a statistical language model for automatic speech recognition systems using a training corpus, comprising the steps of:
- combining at least a portion of elements of a vocabulary to form context-independent vocabulary element classes, wherein the context-independent vocabulary elements are independent of adjoining elements of a context-independent vocabulary element class;
evaluating frequencies of occurrence of vocabulary element sequences, and any of the frequencies of occurrence of derived sequences formed from the vocabulary element sequences by replacement of at least one vocabulary element by an associated vocabulary element class; and
deriving the parameters of the language model from the evaluated frequencies of occurrence.
2 Assignments
0 Petitions
Accused Products
Abstract
An apparatus and method of determining parameters of a statistical language model for automatic speech recognition systems using a training corpus are disclosed. To improve the perplexity and the error rate in the speech recognition, at least a proportion of the elements of a vocabulary used is combined so as to form context-independent vocabulary element categories. The frequencies of occurrence of vocabulary element sequences, and if applicable, the frequencies of occurrence of derived sequences formed from the vocabulary element sequences through the replacement of at least one vocabulary element by the associated vocabulary element class, are evaluated in the language modeling process. The parameters of the language model are then derived from the evaluated frequencies of occurence.
-
Citations
5 Claims
-
1. A method of determining parameters of a statistical language model for automatic speech recognition systems using a training corpus, comprising the steps of:
-
combining at least a portion of elements of a vocabulary to form context-independent vocabulary element classes, wherein the context-independent vocabulary elements are independent of adjoining elements of a context-independent vocabulary element class;
evaluating frequencies of occurrence of vocabulary element sequences, and any of the frequencies of occurrence of derived sequences formed from the vocabulary element sequences by replacement of at least one vocabulary element by an associated vocabulary element class; and
deriving the parameters of the language model from the evaluated frequencies of occurrence. - View Dependent Claims (2, 3, 4, 5)
-
Specification