SYSTEM AND METHOD FOR USER-SPECIFIED PRONUNCIATION OF WORDS FOR SPEECH SYNTHESIS AND RECOGNITION
0 Assignments
0 Petitions
Accused Products
Abstract
The method is performed at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors. A first speech input including at least one word is received. A first phonetic representation of the at least one word is determined, the first phonetic representation comprising a first set of phonemes selected from a speech recognition phonetic alphabet. The first set of phonemes is mapped to a second set of phonemes to generate a second phonetic representation, where the second set of phonemes is selected from a speech synthesis phonetic alphabet. The second phonetic representation is stored in association with a text string corresponding to the at least one word.
-
Citations
38 Claims
-
1-9. -9. (canceled)
-
10. A method for learning word pronunciations, comprising:
at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors; detecting an error in a speech based interaction with a digital assistant; in response to detecting the error, receiving a speech input from a user, the speech input including a pronunciation of one or more words; and storing the pronunciation in association with a text string corresponding to the one or more words. - View Dependent Claims (11, 12, 13, 14)
-
15-28. -28. (canceled)
-
29. A non-transitory computer readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by an electronic device, cause the device to:
-
detect an error in a speech based interaction with a digital assistant; in response to detecting the error, receive a speech input from a user, the speech input including a pronunciation of one or more words; and store the pronunciation in association with a text string corresponding to the one or more words. - View Dependent Claims (30, 31, 32, 33)
-
-
34. An electronic device, comprising:
-
one or more processors; and memory storing one or more programs, the one or more programs including instructions, which when executed by the one or more processors, cause the one or more processors to; detect an error in a speech based interaction with a digital assistant; in response to detecting the error, receive a speech input from a user, the speech input including a pronunciation of one or more words; and store the pronunciation in association with a text string corresponding to the one or more words. - View Dependent Claims (35, 36, 37, 38)
-
Specification