Unsupervised data-driven pronunciation modeling
First Claim
1. A computerized method comprising:
- generating a set of candidate phoneme strings having pronunciations close to an input word in an orthographic space based on a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word, the choice of words from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary; and
selecting phoneme sub-strings from the set as a pronunciation for the input word.
2 Assignments
0 Petitions
Accused Products
Abstract
Pronunciation for an input word is modeled by generating a set of candidate phoneme strings having pronunciations close to the input word in an orthographic space. Phoneme sub-strings in the set are selected as the pronunciation. In one aspect, a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word is used to determine the candidate phoneme strings. The words are chosen from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary. In another aspect, the phoneme sub-strings are selected by aligning the candidate phoneme strings on common phoneme sub-strings to produce an occurrence count, which is used to choose the phoneme sub-strings for the pronunciation.
-
Citations
96 Claims
-
1. A computerized method comprising:
-
generating a set of candidate phoneme strings having pronunciations close to an input word in an orthographic space based on a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word, the choice of words from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary; and selecting phoneme sub-strings from the set as a pronunciation for the input word. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A machine-readable medium having instructions to cause a machine to perform a method comprising:
-
generating a set of candidate phoneme strings having pronunciations close to an input word in an orthographic space based on a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word, the choice of words from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary; and selecting phoneme sub-strings from the set as a pronunciation for the input word. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48)
-
-
49. A system comprising:
-
a processing unit coupled to a memory through a bus; and a pronunciation modeling process executed from the memory by the processing unit to cause the processing unit to generate a set of candidate phoneme strings having pronunciations close to an input word in an orthographic space based on a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word, the choice of words from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary, and select a common phoneme sub-string as a pronunciation for each context based on occurrences of the common sub-strings for the context. - View Dependent Claims (50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72)
-
-
73. An apparatus comprising:
-
means for generating a set of candidate phoneme strings having pronunciations close to an input word in an orthographic space based on a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word, the choice of words from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary; and means for selecting phoneme sub-strings from the set as a pronunciation for the input word. - View Dependent Claims (74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96)
-
Specification