Pitch control in artificial speech
First Claim
Patent Images
1. A method of minimizing distortion due to prosody-related pitch changes in artificial speech, comprising the steps of:
- a) digitally storing waveform samples defining pitch-period waveforms for voiced sounds of said artificial speech;
b) dialing out said samples at a selectable rate to generate said artificial speech;
c) deleting selected samples of said waveforms or adding samples to said waveforms to vary the length of said waveforms in order to vary the prosody-related pitch of said speech;
d) smoothing the transitions between said length-varied waveforms; and
e) varying said dialout rate simultaneously with said deletion or addition of samples to further vary the prosody-related pitch of said speech.
3 Assignments
0 Petitions
Accused Products
Abstract
Substantial pitch variations in artificial speech produced by dialing out a sequence of stored digital waveforms are made possible without significant distortion by varying pitch both by truncation or extension of pitch period waveforms, and by varying the dialout rate. In another aspect of the invention, pitch changes are made more natural by distributing each pitch change evenly over a large number of pitch periods during voiced phonemes.
-
Citations
7 Claims
-
1. A method of minimizing distortion due to prosody-related pitch changes in artificial speech, comprising the steps of:
-
a) digitally storing waveform samples defining pitch-period waveforms for voiced sounds of said artificial speech; b) dialing out said samples at a selectable rate to generate said artificial speech; c) deleting selected samples of said waveforms or adding samples to said waveforms to vary the length of said waveforms in order to vary the prosody-related pitch of said speech; d) smoothing the transitions between said length-varied waveforms; and e) varying said dialout rate simultaneously with said deletion or addition of samples to further vary the prosody-related pitch of said speech. - View Dependent Claims (2, 7)
-
-
3. A method of improving the naturalness of pitch changes in artificial speech, comprising the steps of:
-
a) generating a code train containing a sequence of phoneme codes and pitch codes defining, respectively, voiced and unvoiced speech phonemes to be produced and target pitch levels for said voiced phonemes, said voiced phonemes being composed of a large plurality of pitch periods, and each target level being associated with a specific pitch period of a voiced phoneme having a specific sequential relation to the pitch code identifying that target level; b) producing, in accordance with said train of phoneme codes and pitch codes, a train of concatenated waveforms representing pitch period of phonemes defined by said phoneme codes at pitch levels defined by said pitch codes; c) converting said waveform train into artificial speech; d) determining, whenever the pitch level of a pitch period is equal to a target level, the next target level defined by the next pitch code and the number of pitch periods to said specific pitch period associated with said next target level; and e) changing the pitch value of each successive pitch period by an amount appropriate for reaching said next target level at said specific pitch period. - View Dependent Claims (4, 5, 6)
-
Specification