Formant-based speech synthesizer employing demi-syllable concatenation with independent cross fade in the filter parameter and source domains

US 6,144,939 A
Filed: 11/25/1998
Issued: 11/07/2000
Est. Priority Date: 11/25/1998
Status: Expired

First Claim

Patent Images

1. A concatenative speech synthesizer, comprising:

a database containing (a) demi-syllable waveform data associated with a plurality of demi-syllables and (b) filter parameter data associated with said plurality of demi-syllables;

a unit selection system for extracting selected demi-syllable waveform data and filter parameters from said database that correspond to an input string to be synthesized;

a waveform cross fade mechanism for joining pairs of extracted demi-syllable waveform data into syllable waveform signals;

a filter parameter cross fade mechanism for defining a set of syllable-level filter data by interpolating said extracted filter parameters; and

a filter module receptive of said set of syllable-level filter data and operative to process said syllable waveform signals to generate synthesized speech.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The concatenative speech synthesizer employs demi-syllable subword units to generate speech. The synthesizer is based on a source-filter model that uses source signals that correspond closely to the human glottal source and that uses filter parameters that correspond closely to the human vocal tract. Concatenation of the demi-syllable units is facilitated by two separate cross fade techniques, one applied in the time domain to the demi-syllable source signal waveforms, and one applied in the frequency domain by interpolating the corresponding filter parameters of the concatenated demi-syllables. The dual cross fade technique results in natural sounding synthesis that avoids time-domain glitches without degrading or smearing characteristic resonances in the filter domain.

198 Citations

7 Claims

1. A concatenative speech synthesizer, comprising:
- a database containing (a) demi-syllable waveform data associated with a plurality of demi-syllables and (b) filter parameter data associated with said plurality of demi-syllables;
  
  a unit selection system for extracting selected demi-syllable waveform data and filter parameters from said database that correspond to an input string to be synthesized;
  
  a waveform cross fade mechanism for joining pairs of extracted demi-syllable waveform data into syllable waveform signals;
  
  a filter parameter cross fade mechanism for defining a set of syllable-level filter data by interpolating said extracted filter parameters; and
  
  a filter module receptive of said set of syllable-level filter data and operative to process said syllable waveform signals to generate synthesized speech.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The synthesizer of claim 1 wherein said waveform cross fade mechanism operates in the time domain.
  - 3. The synthesizer of claim 1 wherein said filter parameter cross fade mechanism operates in the frequency domain.
  - 4. The synthesizer of claim 1 wherein said waveform cross fade mechanism performs a linear cross fade upon two demi-syllables over a predefined duration corresponding to a syllable.
  - 5. The synthesizer of claim 1 wherein said filter parameter cross fade mechanism interpolates between the respective extracted filter parameters of two demi-syllables.
  - 6. The synthesizer of claim 1 wherein said filter parameter cross fade mechanism performs linear interpolation between the respective extracted filter parameters of two demi-syllables.
  - 7. The synthesizer of claim 1 wherein said filter parameter cross fade mechanism performs sigmoidal interpolation between the respective extracted filter parameters of two demi-syllables.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Matsushita Electric Industrial Company Limited (Panasonic Holdings Corporation)
Original Assignee
Matsushita Electric Industrial Company Limited (Panasonic Holdings Corporation)
Inventors
Niedzielski, Nancy, Pearson, Steve, Kibre, Nicholas
Primary Examiner(s)
Smits, Talivaldis I.
Assistant Examiner(s)
Nolan, Daniel A.

Application Number

US09/200,327
Time in Patent Office

713 Days
Field of Search

704/200, 704/258, 704/262, 704/259, 704/265, 704/267, 84/600, 84/51
US Class Current

704/258
CPC Class Codes

G10L 13/07 Concatenation rules

Formant-based speech synthesizer employing demi-syllable concatenation with independent cross fade in the filter parameter and source domains

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

198 Citations

7 Claims

Specification

Solutions

Use Cases

Quick Links

Formant-based speech synthesizer employing demi-syllable concatenation with independent cross fade in the filter parameter and source domains

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

198 Citations

7 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links