System and method for preparing a pronunciation dictionary for a text-to-speech voice
First Claim
1. A computer-implemented method of generating a database for a text-to-speech (TTS) voice, the method comprising:
- matching via a processor every spoken word associated with a TTS voice database with a smallest set of possible pronunciations for each word, the smallest set being generated by;
automatically via the processor determining a dialect and linguistic context using linguistic rules;
empirically determining idiosyncratic speaker characteristics; and
determining a subject domain; and
dynamically generating a pronunciation dictionary on a word-by-word basis using the smallest set.
11 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are various elements of a toolkit used for generating a TTS voice for use in a spoken dialog system. The embodiments in each case may be in the form of the system, a computer-readable medium or a method for generating the TTS voice. One embodiment of the invention relates to a method of generating a database for a TTS voice. The method comprises matching every spoken word associated with a TTS voice database with a smallest set of possible pronunciations for each word. The smallest set is generated by automatically determining a dialect and linguistic context using linguistic rules, empirically determining idiosyncratic speaker characteristics and determining a subject domain. The method further comprises dynamically generating a pronunciation dictionary on a word-by-word basis using the smallest set.
46 Citations
20 Claims
-
1. A computer-implemented method of generating a database for a text-to-speech (TTS) voice, the method comprising:
-
matching via a processor every spoken word associated with a TTS voice database with a smallest set of possible pronunciations for each word, the smallest set being generated by; automatically via the processor determining a dialect and linguistic context using linguistic rules; empirically determining idiosyncratic speaker characteristics; and determining a subject domain; and dynamically generating a pronunciation dictionary on a word-by-word basis using the smallest set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 15)
-
-
10. A computing device for generating a database for a text-to-speech (TTS) voice, the computing device comprising:
-
a module configured to control the processor to match every spoken word associated with a TTS voice database with a smallest set of possible pronunciations for each word, the smallest set being generated by; automatically via the processor determining a dialect and linguistic context using linguistic rules; empirically determining idiosyncratic speaker characteristics; and determining a subject domain; and a module configured to control the processor to dynamically generate a pronunciation dictionary on a word-by-word basis using the smallest set. - View Dependent Claims (11, 12, 13, 14, 16, 17)
-
-
18. A tangible computer-readable storage medium storing instructions for controlling a computing device for generating a database for a text-to-speech (TTS) voice, the instructions comprising:
-
matching via a processor every spoken word associated with a TTS voice database with a smallest set of possible pronunciations for each word, the smallest set being generated by; automatically determining a dialect and linguistic context using linguistic rules; empirically determining idiosyncratic speaker characteristics; and determining a subject domain; and dynamically generating a pronunciation dictionary on a word-by-word basis using the smallest set. - View Dependent Claims (19, 20)
-
Specification