Unsupervised data-driven pronunciation modeling
First Claim
1. A computerized method comprising:
- receiving, by a client from a server, an orthographic space;
generating a set of candidate phoneme strings having pronunciations close to an input word in the orthographic space; and
selecting phoneme sub-strings from the set as a pronunciation for the input word.
1 Assignment
0 Petitions
Accused Products
Abstract
Pronunciation for an input word is modeled by generating a set of candidate phoneme strings having pronunciations close to the input word in an orthographic space. Phoneme sub-strings in the set are selected as the pronunciation. In one aspect, a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word is used to determine the candidate phoneme strings. The words are chosen from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary. In another aspect, the phoneme sub-strings are selected by aligning the candidate phoneme strings on common phoneme sub-strings to produce an occurrence count, which is used to choose the phoneme sub-strings for the pronunciation.
35 Citations
88 Claims
-
1. A computerized method comprising:
-
receiving, by a client from a server, an orthographic space; generating a set of candidate phoneme strings having pronunciations close to an input word in the orthographic space; and selecting phoneme sub-strings from the set as a pronunciation for the input word. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computerized method comprising:
-
receiving, by a server from a client, an input word and a neighborhood that identifies phoneme strings having pronunciations close to the input word in an orthographic space; generating a set of candidate phoneme strings for the input word from the phoneme strings; selecting phoneme sub-strings from the set as a pronunciation for the input word; and returning the pronunciation to the client. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14)
-
-
15. A computerized method comprising:
-
receiving, by a client from a server, a neighborhood that identifies phoneme strings having pronunciations close to an input word in an orthographic space; generating a set of candidate phoneme strings for the input word from the phoneme strings; and selecting phoneme sub-strings from the set as a pronunciation for the input word. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
-
23. A machine-readable medium having executable instructions to cause a machine to perform a method comprising:
-
receiving, by a client from a server, an orthographic space; generating a set of candidate phoneme strings having pronunciations close to an input word in the orthographic space; and selecting phoneme sub-strings from the set as a pronunciation for the input word. - View Dependent Claims (24, 25, 26, 27, 28)
-
-
29. A machine-readable medium having executable instructions to cause a machine to perform a method comprising:
-
receiving, by a server from a client, an input word and a neighborhood that identifies phoneme strings having pronunciations close to the input word in an orthographic space; generating a set of candidate phoneme strings for the input word from the phoneme strings; selecting phoneme sub-strings from the set as a pronunciation for the input word; and returning the pronunciation to the client. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36)
-
-
37. A machine-readable medium having executable instructions to cause a machine to perform a method comprising:
-
receiving, by a client from a server, a neighborhood that identifies phoneme strings having pronunciations close to an input word in an orthographic space; generating a set of candidate phoneme strings for the input word from the phoneme strings; and selecting phoneme sub-strings from the set as a pronunciation for the input word. - View Dependent Claims (38, 39, 40, 41, 42, 43, 44)
-
-
45. An apparatus comprising:
-
means for receiving, by a client from a server, an orthographic space; means for generating a set of candidate phoneme strings having pronunciations close to an input word in the orthographic space; and means for selecting phoneme sub-strings from the set as a pronunciation for the input word. - View Dependent Claims (46, 47, 48, 49, 50)
-
-
51. An apparatus comprising:
-
means for receiving, by a server from a client, an input word and a neighborhood that identifies phoneme strings having pronunciations dose to the input word in an orthographic space; means for generating a set of candidate phoneme strings for the input word from the phoneme strings; means for selecting phoneme sub-strings from the set as a pronunciation for the input word; and means for returning the pronunciation to the client. - View Dependent Claims (52, 53, 54, 55, 56, 57, 58)
-
-
59. An apparatus comprising:
-
means for receiving, by a client from a server, a neighborhood that identifies phoneme strings having pronunciations dose to an input word in an orthographic space; means for generating a set of candidate phoneme strings for the input word from the phoneme strings; and means for selecting phoneme sub-strings from the set as a pronunciation for the input word. - View Dependent Claims (60, 61, 62, 63, 64, 65, 66)
-
-
67. A system comprising:
-
a processor coupled to a memory through a bus; an interface coupled to the processor through the bus to receive an input word; a network interface coupled to the processor through the bus to communicate with a server; and a client process executed from the memory by the processor to cause the processor to receive an orthographic space from the server, generate a set of candidate phoneme strings having pronunciations close to the input word in the orthographic space, and select phoneme sub-strings from the set as a pronunciation for the input word. - View Dependent Claims (68, 69, 70, 71, 72)
-
-
73. A system comprising:
-
a processor coupled to a memory through a bus; a network interface coupled to the processor through the bus to communicate with a client; and a server process executed from the memory by the processor to cause the processor to receive, from the client, an input word and a neighborhood that identifies phoneme strings having pronunciations close to the input word in an orthographic space, generate a set of candidate phoneme strings for the input word from the phoneme strings, select phoneme sub-strings from the set as a pronunciation for the input word, and return the pronunciation to the client. - View Dependent Claims (74, 75, 76, 77, 78, 79, 80)
-
-
81. A system comprising:
-
a processor coupled to a memory through a bus; an interface coupled to the processor through the bus to receive an input word; a network interface coupled to the processor through the bus to communicate with a server; and a client process executed from the memory by the processor to cause the processor to receive, from the server, a neighborhood that identifies phoneme strings having pronunciations close to an input word in an orthographic space, generate a set of candidate phoneme strings for the input word from the phoneme strings, select phoneme sub-strings from the set as a pronunciation for the input word, and return the pronunciation to the client. - View Dependent Claims (82, 83, 84, 85, 86, 87, 88)
-
Specification