System and method for repairing a TTS voice database
First Claim
1. A computer implemented method of correcting a database associated with the development of a text-to-speech (TTS) voice, the method comprising:
- generating via a processor a pronunciation dictionary for use with a TTS voice;
generating via the processor a TTS voice to a stage wherein it is prepared to be tested before being deployed;
receiving a single user input to identify all mislabeled phonetic units associated with the TTS voice at the stage wherein it is prepared to be tested before being deployed;
for each identified mislabeled phonetic unit, linking without additional user input to an entry within the pronunciation dictionary to correct the entry; and
deleting, without additional user input, from the pronunciation dictionary utterances and all associated data for unacceptable utterances.
11 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides various elements of a toolkit used for generating a TTS voice for use in a spoken dialog system. The embodiments in each case may be in the form of the system, a computer-readable medium or a method for generating the TTS voice. One embodiment of the invention relates to a method of correcting a database associated with the development of a text-to-speech (TTS) voice. The method comprises generating a pronunciation dictionary for use with a TTS voice, generating a TTS voice to a stage wherein it is prepared to be tested before being deployed, identifying mislabeled phonetic units associated with the TTS voice, for each identified mislabeled phonetic unit, linking to an entry within the pronunciation dictionary to correct the entry and deleting utterances and all associated data for unacceptable utterances.
-
Citations
18 Claims
-
1. A computer implemented method of correcting a database associated with the development of a text-to-speech (TTS) voice, the method comprising:
-
generating via a processor a pronunciation dictionary for use with a TTS voice; generating via the processor a TTS voice to a stage wherein it is prepared to be tested before being deployed; receiving a single user input to identify all mislabeled phonetic units associated with the TTS voice at the stage wherein it is prepared to be tested before being deployed; for each identified mislabeled phonetic unit, linking without additional user input to an entry within the pronunciation dictionary to correct the entry; and deleting, without additional user input, from the pronunciation dictionary utterances and all associated data for unacceptable utterances. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computing device for correcting a database associated with the development of a text-to-speech (TTS) voice, the computing device comprising:
-
a processor; a module configured to control the processor to generate a pronunciation dictionary for use with a TTS voice; a module configured to control the processor to generate a TTS voice to a stage wherein it is prepared to be tested before being deployed; a module configured to control the processor to receive a single user input to identify all mislabeled phonetic units associated with the TTS voice at the stage wherein it is prepared to be tested before being deployed; a module configured to control the processor, for each identified mislabeled phonetic unit, to link to without additional user input an entry within the pronunciation dictionary to correct the entry; and a module configured to control the processor to delete, without additional user input, from the pronunciation dictionary utterances and all associated data for unacceptable utterances. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A non-transitory computer-readable storage medium storing instructions for controlling a computing device to correct a database associated with the development of a text-to-speech (TTS) voice, the instructions comprising:
-
generating via a processor a pronunciation dictionary for use with a TTS voice; generating via a processor a TTS voice to a stage wherein it is prepared to be tested before being deployed; receiving a single user input to identify mislabeled all phonetic units associated with the TTS voice at the stage wherein it is prepared to be tested before being deployed; for each identified mislabeled phonetic unit, linking without additional user input to an entry within the pronunciation dictionary to correct the entry; and deleting, without additional user input, from the pronunciation dictionary utterances and all associated data for unacceptable utterances. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification