Method for prosody generation by unit selection from an imitation speech database
First Claim
Patent Images
1. A computer implemented method for prosody generation, comprising the steps of:
- preparing an imitation speech database using recordings of natural human speech;
converting text to synthesized speech using a rule based speech synthesizer;
selecting prosody units from said imitation speech database to match said synthesized speech; and
concatenating said selected prosody units and generating a final prosody.
2 Assignments
0 Petitions
Accused Products
Abstract
A method is provided for prosody generation by unit selection from an imitation speech database. A rule based method of text to speech conversion is used to produce a set of intonation events by selecting syllables on which there would be either a pitch peak or dip (or a combination), and produces the parameters to generate a pitch curve of the event. The synthetic pitch curve shape generated by the rule based method is then utilized to select the best matching units from an imitation speech database of a speaker'"'"'s prosody, which are then concatenated to reduce the final prosody.
24 Citations
12 Claims
-
1. A computer implemented method for prosody generation, comprising the steps of:
-
preparing an imitation speech database using recordings of natural human speech;
converting text to synthesized speech using a rule based speech synthesizer;
selecting prosody units from said imitation speech database to match said synthesized speech; and
concatenating said selected prosody units and generating a final prosody. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer implemented method for prosody generation, comprising the steps of:
preparing an imitation speech prosody database including;
converting training text to synthesized speech using a rule based computer synthesizer;
recording human speech imitating said synthesized speech;
time aligning said recorded human speech with said synthesized speech and extracting features from said recorded speech for syllables in which intonation events occur and generating said imitation speech prosody database; and
generating speech prosody from text including;
converting text to synthesized speech using a rule based synthesizer;
selecting prosody units from said imitation speech prosody database to match said synthesized speech; and
concatenating said selected prosody units and generating a final prosody. - View Dependent Claims (7, 8, 9, 10, 11)
-
12. A speech generation processor for processing input text to speech, comprising:
-
an imitation speech database including prosodic units from imitation speech;
a rule based synthesizer module for generating synthesized speech curves for input text;
an imitation speech prosody selection module for selecting prosodic units from said imitation speech database with said synthesized speech curves and concatenating said selected prosodic units together for speech generation; and
an audible device for receiving a speech generation signal from said imitation speech prosody selection module and generating audible speech.
-
Specification