×

Text-to-speech synthesizer having formant-rule and speech-parameter synthesis modes

  • US 5,204,905 A
  • Filed: 05/29/1990
  • Issued: 04/20/1993
  • Est. Priority Date: 05/29/1989
  • Status: Expired due to Fees
First Claim
Patent Images

1. A text-to-speech synthesizer comprising:

  • analyzer means for decomposing a sequence of input characters into phoneme components and classifying the decomposed phoneme components as a first group of phoneme components if each phoneme component is to be synthesized by a speech parameter and classifying said phoneme components as a second group of phoneme components if each phoneme component is to be synthesized by a formant rule;

    first memory means for storing speech parameters derived from natural human speech, said speech parameters corresponding to the phoneme components of said first group and being retrievable from said first memory means in response to each of the phoneme components of the first group;

    second memory means for storing formant rules for generating formant transition patterns, said formant rules corresponding to the phoneme components of said second group and being retrievable from said second memory means in response to each of the phoneme components of the second group;

    means for retrieving a speech parameter from said first memory means in response to one of the phoneme components of the first group;

    means for retrieving a formant rule from said second memory means in response to one of said phoneme components of the second group and deriving a formant transition pattern from the retrieved formant rule;

    parameter converter means for converting a formant of said derived formant transition pattern into a corresponding speech parameter; and

    speech synthesizer means for synthesizing a human speech utterance from the speech parameter retrieved from said first memory means and synthesizing a human speech utterance from the speech parameter converted by said parameter converter means,wherein said speech parameters stored in said first memory means are represented by auto-regressive (AR) parameters, and said formant of said derived formant transition patterns are represented by frequency and bandwidth values, wherein said parameter converter means comprises;

    means for converting the frequency value of said formant into a value equal to C=cos(2π

    F/fs), where F is said frequency value and fs represents a sampling frequency, and converting the bandwidth value of said formant into a value equal to R=exp(-π

    B/fs), where B is the bandwidth value;

    means for generating a first signal representative of a value 2×



    R and a second signal representative of a value R2 ;

    unit impulse generator for generating a unit impulse; and

    a series of second-order transversal filters connected in series from said unit impulse generator to said speech synthesizer means, each of said second-order transversal filters including a tapped delay line, first and second tap-weight multipliers connected respectively to successive taps of said tapped delay line, and an adder for summing the outputs of said multipliers with said unit impulse, said first and second multipliers multiplying signals at said successive taps with said first and second signals, respectively.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×