×

Method and apparatus for converting voice characteristics of synthesized speech

  • US 4,624,012 A
  • Filed: 05/06/1982
  • Issued: 11/18/1986
  • Est. Priority Date: 05/06/1982
  • Status: Expired due to Fees
First Claim
Patent Images

1. A text-to-speech synthesis system for producing audible synthesized human speech of any one of a plurality of voice sounds simulating child-like, adult, aged and sexual characteristics from digital characters comprising:

  • text reader means adapted to be exposed to text material and responsive thereto for generating information signals indicative of the substantive content thereof;

    converter means for receiving said information signals from said text reader means and generating digital character signals representative thereof;

    means for receiving said digital character signals from said converter means;

    memory means storing digital speech data including digital speech instructional rules and digital speech data representative of sound unit code signals;

    data processing means for searching said digital speech data stored in said memory means to locate digital speech data representative of a sound unit code corresponding to said digital character signals received from said converter means;

    speech memory means storing digital speech data representative of a plurality of sound units;

    concatenating controller means operably coupled to said speech memory means for selectively combining digital speech data representative of a plurality of sound units in a serial sequence to provide concatenated digital speech data representative of a word;

    speech synthesis controller means coupled to said data processing means and to said speech memory means for receiving digital speech signals representative of a sound unit code corresponding to said digital character signals and selectively accessing digital speech data representative of sound units corresponding to said sound unit code from said speech memory means;

    speech synthesizer means operably coupled to said concatenating controller means and said speech synthesis controller means for receiving selectively accessed serial sequences of digital speech data from said concatenating controller means to provide audio signals corresponding thereto and representative of synthesized human speech;

    voice characteristics conversion means interposed between said concatenating controller means and said speech synthesizer means and being coupled therebetween independently of the coupling between said concatenating controller means and said speech synthesizer means, said voice characteristics conversion means being operably coupled to said speech synthesis controller means and being responsive thereto to selectively modify the voice characteristics of said serially sequenced digital speech data output from said concatenating controller means, said voice characteristics conversion means includingmeans for making a voice character selection of the synthesized speech to be derived from the digital speech data as selectively accessed from said speech memory means so as to simulate a voice sound differing in character with respect to the voice sound of the synthesized speech from the digital speech data of said speech memory means in the voice characteristics pertaining to the apparent age and/or sex of the speaker;

    said digital speech data as selectively accessed from said speech memory means having a predetermined pitch period, a predetermined vocal tract model and a predetermined speech rate;

    speech parameter control means for modifying the pitch period and speech rate in response to inputs from said voice character selection means to produce a modified pitch period and a modified speech rate, said speech parameter control means including sample rate control circuit means responsive to inputs from said voice character selection means for adjusting the sampling period of said digital speech data selectively accessed from said speech memory means in a manner altering the digital speech formants contained therein to a preselected degree and providing adjusted sampling period signals as an output;

    speech data reconstructing means operably associated with said speech parameter control means for combining the modified pitch period and the modified speech rate with the predetermined vocal tract model into a synthesized speech data format of speech data modified with respect to the original speech data from said speech memory means;

    said speech synthesizer means being coupled to said speech data reconstructing means and to the output of said sample rate control circuit means for receiving the modified speech data and the adjusted sampling period signals therefrom in providing said audio signals representative of human speech from the modified speech data; and

    audio means coupled to said speech synthesizer means for converting said audio signals into audible synthesized human speech in any one of a plurality of voice sound from said digital speech data stored in said speech memory means as determined by said voice characteristics conversion means.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×