Word-specific acoustic models in a speech recognition system
First Claim
Patent Images
1. An acoustic model in a speech recognition system having a lexicon in which words map to phones modeled in the acoustic model, the acoustic model comprising:
- a plurality of shared phone models modeling a plurality of shared phones used to transcribe words in the lexicon, the shared phone models and shared phones being shared among the words in the lexicon;
a candidate word model modeling a word-specific phone representing a transcription of a portion of a candidate word in the lexicon, the word-specific phone replacing in a transcription of the candidate word one or more of the shared phones, the word-specific phone and the candidate word model being shared by fewer than all words in the lexicon that can be transcribed by the shared phones replaced by the word-specific phone.
2 Assignments
0 Petitions
Accused Products
Abstract
An acoustic model includes word-specific models, that are specific to candidate words. The candidate words would otherwise be mapped to a series of general phones. A sub-series of the general phones representing the candidate word is modeled by a new phone and the new phone is dedicated to the candidate word, or a small group of similar words, but the new phone is not shared among all words that otherwise map to the sub-series of general phones.
-
Citations
22 Claims
-
1. An acoustic model in a speech recognition system having a lexicon in which words map to phones modeled in the acoustic model, the acoustic model comprising:
-
a plurality of shared phone models modeling a plurality of shared phones used to transcribe words in the lexicon, the shared phone models and shared phones being shared among the words in the lexicon; a candidate word model modeling a word-specific phone representing a transcription of a portion of a candidate word in the lexicon, the word-specific phone replacing in a transcription of the candidate word one or more of the shared phones, the word-specific phone and the candidate word model being shared by fewer than all words in the lexicon that can be transcribed by the shared phones replaced by the word-specific phone. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of training an acoustic model, comprising:
-
receiving a set of shared phone models and corresponding transcriptions with shared phones; initializing candidate word models each with data corresponding to one or more of the shared phones; and training the candidate word models on fewer than all instances of words that contain the shared phones used to initialize the candidate word models. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A computer readable medium, comprising:
-
an acoustic model in a speech recognition system having a lexicon in which words are transcribed as phones modeled in the acoustic model, the acoustic model comprising; a plurality of shared phone models modeling a plurality of shared phones used to transcribe words in the lexicon, the shared phone models and shared phones being shared among the words in the lexicon; a candidate word model modeling a word-specific phone representing a transcription of a portion of a candidate word in the lexicon, the word-specific phone replacing in a transcription of the candidate word one or more of the shared phones, the word-specific phone and the candidate word model being shared by fewer than all words in the lexicon that would otherwise be transcribed by the shared phones that are replaced by the word-specific phone.
-
-
21. A speech recognition system, comprising:
-
an input receiving a signal indicative of speech; a lexicon including words transcribed by phones; an acoustic model modeling shared phones shared among the words in the lexicon and word-specific phones shared among a selected group of words that would otherwise be lexically transcribed with shared phones; a language model modeling word order; and a decoder coupled to the input, the acoustic model and the language model, recognizing speech represented by the signal. - View Dependent Claims (22)
-
Specification