Method and tool for customization of speech synthesizer databases using hierarchical generalized speech templates

US 6,513,008 B2
Filed: 03/15/2001
Issued: 01/28/2003
Est. Priority Date: 03/15/2001
Status: Expired due to Term

First Claim

Patent Images

1. A speech synthesizer customization system comprising:

a template management tool for generating templates based on customization data from a user and replicated dynamic synthesis data from a text-to-speech synthesizer, the replicated dynamic synthesis data being arranged in a dynamic data structure having hierarchical levels, wherein each template defines a condition under which the template is used to override the speech synthesis data;

a user database supplementing a standard database of the synthesizer;

said tool populating the user database with the templates such that the templates enable the user database to uniformly override subsequently generated speech synthesis data at all hierarchical levels of the dynamic data structure.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A speech synthesizer customization system provides a mechanism for generating a hierarchical customized user database. The customization system has a template management tool for generating the templates based on customization data from a user and associated replicated dynamic synthesis data from a text-to-speech (TTS) synthesizer. The replicated dynamic synthesis data is arranged in a dynamic data structure having hierarchical levels. The customization system further includes a user database that supplements a standard database of the synthesizer. The tool populates the user database with the templates such that the templates enable the user database to uniformly override subsequently generated speech synthesis data at all hierarchical levels of the dynamic data structure.

173 Citations

24 Claims

1. A speech synthesizer customization system comprising:
- a template management tool for generating templates based on customization data from a user and replicated dynamic synthesis data from a text-to-speech synthesizer, the replicated dynamic synthesis data being arranged in a dynamic data structure having hierarchical levels, wherein each template defines a condition under which the template is used to override the speech synthesis data;
  
  a user database supplementing a standard database of the synthesizer;
  
  said tool populating the user database with the templates such that the templates enable the user database to uniformly override subsequently generated speech synthesis data at all hierarchical levels of the dynamic data structure.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
- - 2. The customization system of claim 1 wherein each template defines an action to be executed in order to override the speech synthesis data.
  - 3. The customization system of claim 1 wherein the condition corresponds to a hierarchical level of a linguistic tree structure.
  - 4. The customization system of claim 1 wherein the condition corresponds to a hierarchical level of an acoustic tree structure.
  - 5. The customization system of claim 1 wherein the tool includes:
6. The customization system of claim 5 wherein the input interfaces include a command interpreter operatively coupled between a keyboard device input and the template generator.
7. The customization system of claim 5 wherein the input interfaces include a graphics tools module operatively coupled between a mouse device input and the template generator.
8. The customization system of claim 5 wherein the input interfaces include a sound processing module operatively coupled between a microphone device input and the template generator.
9. The customization system of claim 8 wherein the sound processing module includes:
- an input waveform submodule for generating an input waveform based on data obtained from the microphone device input;
  
  a pitch extraction submodule for generating pitch data based on the input waveform;
  
  a formant analysis submodule for generating formant data based on the input waveform; and
  
  a phoneme labeling submodule for automatically labeling phonemes based on the input waveform.

10. A user database comprising:
- a plurality of templates for overriding speech synthesis data of a text-to-speech synthesizer, wherein each template defines a condition under which the template is used to override the speech synthesis data;
  
  said speech synthesis data being arranged in a dynamic data structure having hierarchical levels; and
  
  a hierarchical data structure organizing the templates such that the templates enable the user database to uniformly override subsequently generated speech synthesis data at all hierarchical levels of the dynamic data structure.
- View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
- - 11. The user database of claim 10 wherein each template defines a condition under which the template is used to override the speech synthesis data and an action to be executed in order to override data.
  - 12. The user database of claim 10 wherein the condition corresponds to a sentence level of a linguistic tree structure.
  - 13. The user database of claim 10 wherein the condition corresponds to a clause level of a linguistic tree structure.
  - 14. The user database of claim 10 wherein the condition corresponds to a phrase level of a linguistic tree structure.
  - 15. The user database of claim 10 wherein the condition corresponds to a word level of a linguistic tree structure.
  - 16. The user database of claim 10 wherein the condition corresponds to a morpheme level of a linguistic tree structure.
  - 17. The user database of claim 10 wherein the condition corresponds to a phoneme level of a linguistic tree structure.
  - 18. The user database of claim 10 wherein the condition corresponds to an utterance level of an acoustic tree structure.
  - 19. The user database of claim 10 wherein the condition corresponds to a prosodic phrase level of an acoustic tree structure.
  - 20. The user database of claim 10 wherein the condition corresponds to a prosodic word level of an acoustic tree structure.
  - 21. The user database of claim 10 wherein the condition corresponds to a syllable level of an acoustic tree structure.
  - 22. The user database of claim 10 wherein the condition corresponds to an allophone level of an acoustic tree structure.

23. A method for customizing a text-to-speech synthesizer, the method comprising the steps of:
- (a) generating templates based on customization data from a user and replicated dynamic synthesis data from the synthesizer, wherein each template defines a condition under which the template is used to override the dynamic synthesis data and an action to be executed in order to override data;
  
  (b) supplementing a standard database of the synthesizer with a user database; and
  
  (c) populating the user database with the templates such that the templates enable the user database to uniformly override subsequently generated speech synthesis data at a plurality of hierarchical levels of the dynamic data structure.
- View Dependent Claims (24)
- - 24. The method of claim 23 further including the step of iteratively repeating steps (a) through (c) until a desired synthesizer output is obtained.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Panasonic Intellectual Property Corporation of America (Panasonic Holdings Corporation)
Original Assignee
Matsushita Electric Industrial Company Limited (Panasonic Holdings Corporation)
Inventors
Pearson, Steve, Veprek, Peter, Junqua, Jean-Claude
Primary Examiner(s)
Dorvil, Richemond

Application Number

US09/808,132
Publication Number

US 20020133348A1
Time in Patent Office

684 Days
Field of Search

704/260, 704/258, 704/268, 704/267, 704/275, 704/249, 704/254, 704/257, 704/9, 704/207, 704/209, 704/264, 704/270
US Class Current

704/260
CPC Class Codes

G10L 13/033 Voice editing, e.g. manipul...

Method and tool for customization of speech synthesizer databases using hierarchical generalized speech templates

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

173 Citations

24 Claims

Specification

Solutions

Use Cases

Quick Links

Method and tool for customization of speech synthesizer databases using hierarchical generalized speech templates

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

173 Citations

24 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links