System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies
First Claim
1. A method for providing speech recognition, the method comprising the steps of:
- partitioning a language vocabulary V of word forms of into subsets of word forms based on frequencies of occurrence of the respective word forms;
in at least one of said subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components; and
generating a language component vocabulary VC comprising the word forms and the word form components.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms is disclosed. The method includes: partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms; and in at least one of the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components. Also disclosed is a method for use in speech recognition including: splitting an acoustic vocabulary comprising baseforms into baseform components and storing the baseform components; and, performing sound to spelling mapping on the baseform components so as to generate a baseform components to word parts table for use in subsequent decoding of speech. A method for decoding a speech utterance using language model components and acoustic components, includes the steps of: generating from the utterance a stack of baseform component paths; concatenating baseform components in a path to generate concatenated baseforms, when the concatenated baseform components correspond to a baseform found in an acoustic vocabulary; mapping the concatenated baseforms into words; computing language model (LM) scores associated with the words using a language model, and performing further decoding of the utterance based thereupon.
250 Citations
37 Claims
-
1. A method for providing speech recognition, the method comprising the steps of:
-
partitioning a language vocabulary V of word forms of into subsets of word forms based on frequencies of occurrence of the respective word forms;
in at least one of said subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components; and
generating a language component vocabulary VC comprising the word forms and the word form components. - View Dependent Claims (6, 7, 9, 11, 12, 13, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 35, 36)
-
-
2-5. -5. (canceled)
-
8. (canceled)
-
10. (canceled)
-
14-18. -18. (canceled)
-
25. (canceled)
-
30-34. -34. (canceled)
-
37-47. -47. (canceled)
Specification