Pitch control in artificial speech

US 5,163,110 A
Filed: 08/13/1990
Issued: 11/10/1992
Est. Priority Date: 08/13/1990
Status: Expired due to Fees

First Claim

Patent Images

1. A method of minimizing distortion due to prosody-related pitch changes in artificial speech, comprising the steps of:

a) digitally storing waveform samples defining pitch-period waveforms for voiced sounds of said artificial speech;

b) dialing out said samples at a selectable rate to generate said artificial speech;

c) deleting selected samples of said waveforms or adding samples to said waveforms to vary the length of said waveforms in order to vary the prosody-related pitch of said speech;

d) smoothing the transitions between said length-varied waveforms; and

e) varying said dialout rate simultaneously with said deletion or addition of samples to further vary the prosody-related pitch of said speech.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Substantial pitch variations in artificial speech produced by dialing out a sequence of stored digital waveforms are made possible without significant distortion by varying pitch both by truncation or extension of pitch period waveforms, and by varying the dialout rate. In another aspect of the invention, pitch changes are made more natural by distributing each pitch change evenly over a large number of pitch periods during voiced phonemes.

Citations

7 Claims

1. A method of minimizing distortion due to prosody-related pitch changes in artificial speech, comprising the steps of:
- a) digitally storing waveform samples defining pitch-period waveforms for voiced sounds of said artificial speech;
  
  b) dialing out said samples at a selectable rate to generate said artificial speech;
  
  c) deleting selected samples of said waveforms or adding samples to said waveforms to vary the length of said waveforms in order to vary the prosody-related pitch of said speech;
  
  d) smoothing the transitions between said length-varied waveforms; and
  
  e) varying said dialout rate simultaneously with said deletion or addition of samples to further vary the prosody-related pitch of said speech.
- View Dependent Claims (2, 7)
- - 2. The method of claim 1, in which said deleting or adding is done only in the most quiet portion of each of said waveforms.
  - 7. The method of claim 1, in which substantially one third of each pitch variation is produced by varying said dialout rate, and two thirds are produced by deleting or adding samples.

3. A method of improving the naturalness of pitch changes in artificial speech, comprising the steps of:
- a) generating a code train containing a sequence of phoneme codes and pitch codes defining, respectively, voiced and unvoiced speech phonemes to be produced and target pitch levels for said voiced phonemes, said voiced phonemes being composed of a large plurality of pitch periods, and each target level being associated with a specific pitch period of a voiced phoneme having a specific sequential relation to the pitch code identifying that target level;
  
  b) producing, in accordance with said train of phoneme codes and pitch codes, a train of concatenated waveforms representing pitch period of phonemes defined by said phoneme codes at pitch levels defined by said pitch codes;
  
  c) converting said waveform train into artificial speech;
  
  d) determining, whenever the pitch level of a pitch period is equal to a target level, the next target level defined by the next pitch code and the number of pitch periods to said specific pitch period associated with said next target level; and
  
  e) changing the pitch value of each successive pitch period by an amount appropriate for reaching said next target level at said specific pitch period.
- View Dependent Claims (4, 5, 6)
- - 4. The method of claim 3, in which said specific pitch period is a predetermined pitch period of the first voiced phoneme defined by a phoneme code following the pitch code defining said next target level.
  - 5. The method of claim 4, in which said specific pitch period is at the center of said first voiced phoneme.
  - 6. The method of claim 3, in which said pitch value remains constant during unvoiced phonemes.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Sierra Entertainment, Inc. (Vivendi SE)
Original Assignee
First Byte
Inventors
Arthur, William J., Sprague, Richard P.
Primary Examiner(s)
MacDonald, Allen R.
Assistant Examiner(s)
Knepper, David D.

Application Number

US07/566,963
Time in Patent Office

820 Days
Field of Search

381/36-40, 381/49, 381/50, 381/51-53, 395/2
US Class Current

704/200
CPC Class Codes

G10L 13/10 Prosody rules derived from ...

Pitch control in artificial speech

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

Citations

7 Claims

Specification

Solutions

Use Cases

Quick Links

Pitch control in artificial speech

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

7 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links