SYSTEM AND METHOD FOR TRAINING AN ACOUSTIC MODEL WITH REDUCED FEATURE SPACE VARIATION
First Claim
1. A method of training an acoustic model, the method comprising:
- generating a specific text element set as a subset of a general text element set;
generating a combined phoneme set, the combined phoneme set including renamed specific phonemes corresponding to the specific text element set;
generating a combined dictionary, the combined dictionary including renamed specific text elements from the specific text element set with phonetic spellings including the renamed specific phonemes;
generating a combined transcription set, the combined transcription set including transcriptions with the renamed specific text elements; and
training the acoustic model using the combined phoneme set, the combined dictionary, the combined transcription set and an audio file set.
2 Assignments
0 Petitions
Accused Products
Abstract
Feature space variation associated with specific text elements is reduced by training an acoustic model with a phoneme set, dictionary and transcription set configured to better distinguish the specific text elements and at least some specific phonemes associated therewith. The specific text elements can include the most frequently occurring text elements from a text data set, which can include text data beyond the transcriptions of a training data set. The specific text elements can be identified using a text element distribution table sorted by occurrence within the text data set. Specific phonemes can be limited to consonant phonemes to improve speed and accuracy.
-
Citations
30 Claims
-
1. A method of training an acoustic model, the method comprising:
-
generating a specific text element set as a subset of a general text element set; generating a combined phoneme set, the combined phoneme set including renamed specific phonemes corresponding to the specific text element set; generating a combined dictionary, the combined dictionary including renamed specific text elements from the specific text element set with phonetic spellings including the renamed specific phonemes; generating a combined transcription set, the combined transcription set including transcriptions with the renamed specific text elements; and training the acoustic model using the combined phoneme set, the combined dictionary, the combined transcription set and an audio file set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. An acoustic model training system comprising:
-
a combined phoneme set including renamed specific phonemes; a combined dictionary including renamed specific text elements with corresponding phonetic spellings using the renamed specific phonemes; an audio file set; a combined transcription set corresponding to the audio file set and including transcriptions with the renamed specific text elements; and a training module configured to train the acoustic model based on the audio file set, the combined transcription set, the combined phoneme set and the combined dictionary. - View Dependent Claims (22, 23, 24, 25)
-
-
26. A method of training an acoustic model, the method comprising:
-
generating a specific word set based on frequency of occurrence within a text data set; generating a phoneme set including renamed specific phonemes used in phonetic spellings of the specific word set; generating a dictionary including renamed specific words of the specific word set with phonetic spellings including the renamed specific phonemes; generating a transcription set including transcriptions having the renamed specific words therein; and training the acoustic model based on the phoneme set, the dictionary, the transcription set and an audio file set. - View Dependent Claims (27, 28, 29, 30)
-
Specification