Method and tool for customization of speech synthesizer databases using hierarchical generalized speech templates
First Claim
1. A speech synthesizer customization system comprising:
- a template management tool for generating templates based on customization data from a user and replicated dynamic synthesis data from a text-to-speech synthesizer, the replicated dynamic synthesis data being arranged in a dynamic data structure having hierarchical levels, wherein each template defines a condition under which the template is used to override the speech synthesis data;
a user database supplementing a standard database of the synthesizer;
said tool populating the user database with the templates such that the templates enable the user database to uniformly override subsequently generated speech synthesis data at all hierarchical levels of the dynamic data structure.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech synthesizer customization system provides a mechanism for generating a hierarchical customized user database. The customization system has a template management tool for generating the templates based on customization data from a user and associated replicated dynamic synthesis data from a text-to-speech (TTS) synthesizer. The replicated dynamic synthesis data is arranged in a dynamic data structure having hierarchical levels. The customization system further includes a user database that supplements a standard database of the synthesizer. The tool populates the user database with the templates such that the templates enable the user database to uniformly override subsequently generated speech synthesis data at all hierarchical levels of the dynamic data structure.
173 Citations
24 Claims
-
1. A speech synthesizer customization system comprising:
-
a template management tool for generating templates based on customization data from a user and replicated dynamic synthesis data from a text-to-speech synthesizer, the replicated dynamic synthesis data being arranged in a dynamic data structure having hierarchical levels, wherein each template defines a condition under which the template is used to override the speech synthesis data;
a user database supplementing a standard database of the synthesizer;
said tool populating the user database with the templates such that the templates enable the user database to uniformly override subsequently generated speech synthesis data at all hierarchical levels of the dynamic data structure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
a template generator for processing the replicated dynamic synthesis data based on the customization data;
an output interface for graphically displaying the replicated dynamic synthesis data to the user; and
one or more input interfaces for obtaining the customization data from the user.
-
-
6. The customization system of claim 5 wherein the input interfaces include a command interpreter operatively coupled between a keyboard device input and the template generator.
-
7. The customization system of claim 5 wherein the input interfaces include a graphics tools module operatively coupled between a mouse device input and the template generator.
-
8. The customization system of claim 5 wherein the input interfaces include a sound processing module operatively coupled between a microphone device input and the template generator.
-
9. The customization system of claim 8 wherein the sound processing module includes:
-
an input waveform submodule for generating an input waveform based on data obtained from the microphone device input;
a pitch extraction submodule for generating pitch data based on the input waveform;
a formant analysis submodule for generating formant data based on the input waveform; and
a phoneme labeling submodule for automatically labeling phonemes based on the input waveform.
-
-
10. A user database comprising:
-
a plurality of templates for overriding speech synthesis data of a text-to-speech synthesizer, wherein each template defines a condition under which the template is used to override the speech synthesis data;
said speech synthesis data being arranged in a dynamic data structure having hierarchical levels; and
a hierarchical data structure organizing the templates such that the templates enable the user database to uniformly override subsequently generated speech synthesis data at all hierarchical levels of the dynamic data structure. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method for customizing a text-to-speech synthesizer, the method comprising the steps of:
-
(a) generating templates based on customization data from a user and replicated dynamic synthesis data from the synthesizer, wherein each template defines a condition under which the template is used to override the dynamic synthesis data and an action to be executed in order to override data;
(b) supplementing a standard database of the synthesizer with a user database; and
(c) populating the user database with the templates such that the templates enable the user database to uniformly override subsequently generated speech synthesis data at a plurality of hierarchical levels of the dynamic data structure. - View Dependent Claims (24)
-
Specification