System and method of developing a TTS voice
First Claim
1. A method of tracking progress in developing a text-to-speech (TTS) voice, the method causing a computing device to perform steps comprising:
- checking a corpus of recorded speech for conformity between the corpus and a text;
creating, via a processor of the computing device, a tuple of files for each utterance in the corpus, wherein the tuple is used to track work on each utterance for developing the TTS voice; and
tracking progress of developing the TTS voice with respect to the each utterance using at least the tuple of files created for the each utterance, wherein each tuple comprises automatic speech recognition generated phonemes, pronunciation lists, confidence scores and a progress matrix.
10 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are various aspects of a toolkit used for generating a TTS voice for use in a spoken dialog system. The embodiments in each case may be in the form of the system, a computer-readable medium or a method for generating the TTS voice. An embodiment of the invention relates to a method of tracking progress in developing a text-to-speech (TTS) voice. The method comprises insuring that a corpus of recorded speech contains reading errors and matches an associated written text, creating a tuple for each utterance in the corpus and tracking progress for each utterance utilizing the tuple. Various parameters may be tracked using the tuple but the tuple provides a means for enabling multiple workers to efficiently process a database of utterance in preparation of a TTS voice.
21 Citations
12 Claims
-
1. A method of tracking progress in developing a text-to-speech (TTS) voice, the method causing a computing device to perform steps comprising:
-
checking a corpus of recorded speech for conformity between the corpus and a text; creating, via a processor of the computing device, a tuple of files for each utterance in the corpus, wherein the tuple is used to track work on each utterance for developing the TTS voice; and tracking progress of developing the TTS voice with respect to the each utterance using at least the tuple of files created for the each utterance, wherein each tuple comprises automatic speech recognition generated phonemes, pronunciation lists, confidence scores and a progress matrix. - View Dependent Claims (2, 3, 4)
-
-
5. A non-transitory computer-readable storage medium storing instructions which, when executed by a computing device, cause the computing device to track progress in developing a text-to-speech (TTS) voice, the instructions comprising:
-
checking a corpus of recorded speech for conformity between the corpus and a text; creating, via a processor, a tuple of files for each utterance in the corpus, wherein the tuple is used to track work on each utterance for developing the TTS voice; and tracking progress of developing the TTS voice with respect to the each utterance using at least the tuple of files created for the each utterance, wherein each tuple comprises automatic speech recognition generated phonemes, pronunciation lists, confidence scores and a progress matrix. - View Dependent Claims (6, 7, 8)
-
-
9. A computing device that tracks progress in developing a text-to-speech (TTS) voice, the computing device comprising:
-
a processor; a module controlling the processor to check a corpus of recorded speech for conformity between the corpus and a text; a module controlling the processor to create a tuple of files for each utterance in the corpus, wherein the tuple is used to track work on each utterance for developing the TTS voice; and tracking progress of developing TTS voice with respect to the each utterance using at least the tuple of files created for the each utterance, wherein each tuple comprises automatic speech recognition generated phonemes, pronunciation lists, confidence scores and a progress matrix. - View Dependent Claims (10, 11, 12)
-
Specification