SYSTEM AND METHOD FOR PRONUNCIATION MODELING
First Claim
1. A computer-implemented method of generating a pronunciation model, the method comprising:
- identifying a generic model of speech composed of phonemes;
identifying a family of interchangeable phonemic alternatives for a phoneme in the generic model of speech;
labeling each interchangeable phonemic alternative in the family as referring to the phoneme; and
generating a pronunciation model which substitutes the family of interchangeable phonemic alternatives for the phoneme.
15 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for generating a pronunciation model. The method includes identifying a generic model of speech composed of phonemes, identifying a family of interchangeable phonemic alternatives for a phoneme in the generic model of speech, labeling the family of interchangeable phonemic alternatives as referring to the same phoneme, and generating a pronunciation model which substitutes each family for each respective phoneme. In one aspect, the generic model of speech is a vocal tract length normalized acoustic model. Interchangeable phonemic alternatives can represent a same phoneme for different dialectal classes. An interchangeable phonemic alternative can include a string of phonemes.
161 Citations
10 Claims
-
1. A computer-implemented method of generating a pronunciation model, the method comprising:
-
identifying a generic model of speech composed of phonemes; identifying a family of interchangeable phonemic alternatives for a phoneme in the generic model of speech; labeling each interchangeable phonemic alternative in the family as referring to the phoneme; and generating a pronunciation model which substitutes the family of interchangeable phonemic alternatives for the phoneme. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-implemented method of recognizing speech using a pronunciation model, the method comprising:
-
identifying a user dialect in the received user speech; selecting a set of phoneme alternatives representing the user dialect from a pronunciation model generated by the steps of; (i) identifying a generic model of speech composed of phonemes; (ii) identifying a family of interchangeable phonemic alternatives for a phoneme in the generic model of speech; (iii) labeling each interchangeable phonemic alternative in the family as referring to the phoneme; and (iv) generating a pronunciation model which substitutes the family of interchangeable phonemic alternatives for the phoneme; recognizing the user speech using the selected set of phoneme alternatives in the pronunciation model. - View Dependent Claims (7, 8, 9, 10)
-
Specification