Low data rate speech encoding employing syllable pitch patterns

US 4,802,223 A
Filed: 11/03/1983
Issued: 01/31/1989
Est. Priority Date: 11/03/1983
Status: Expired due to Term

First Claim

Patent Images

1. A speech encoding apparatus comprising:

input means for receiving speech including one or more words of human language;

analysis means connected to said input means for analyzing said received speech, generating a sequence of phonological linguistic unit indicia corresponding to said received speech, grouping said phonological linguistic unit indicia into syllables, and generating pitch track data corresponding to said received speech;

pitch pattern memory means storing a plurality of predetermined pitch patterns;

pitch pattern recognizer means connected to said analysis means and to said pitch pattern memory means for selecting a pitch pattern from said plurality of predetermined pitch patterns for each syllable grouping of phonological linguistic unit indicia as generated by said analysis means, said pitch pattern being selected in dependence upon said pitch track data corresponding to each syllable grouping of phonological linguistic unit indicia; and

transmission means connected to said analysis means and said pitch pattern recognizer means for transmitting said phonological linguistic unit indicia and pitch pattern indicia corresponding to said selected pitch patterns.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The present invention is a speech encoding technique useful in low data rate speech. Spoken input is analyzed to determine its basic phonological linguistic units and syllables. The pitch track for each syllable is compared with each of a predetermined set of pitch patterns. A pitch pattern forming the best match to the actual pitch track is selected for each syllable. Phonological linguistic unit indicia and pitch pattern indicia are transmitted to a speech synthesis apparatus. This synthesis apparatus matches the pitch pattern indicia to syllable groupings of the phonological linguistic unit indicia. During speech synthesis, sounds are produced corresponding to the phonological linguistic unit indicia with their primary pitch controlled by the pitch pattern indicia of the corresponding syllable. This achieves some measure of approximation to the primary pitch of the original spoken input at a low data rate. In the preferred embodiment, each pitch pattern includes an initial pitch slope, which may be zero indicating no change in pitch, a final pitch slope and a turning point between these two slopes.

180 Citations

10 Claims

1. A speech encoding apparatus comprising:
- input means for receiving speech including one or more words of human language;
  
  analysis means connected to said input means for analyzing said received speech, generating a sequence of phonological linguistic unit indicia corresponding to said received speech, grouping said phonological linguistic unit indicia into syllables, and generating pitch track data corresponding to said received speech;
  
  pitch pattern memory means storing a plurality of predetermined pitch patterns;
  
  pitch pattern recognizer means connected to said analysis means and to said pitch pattern memory means for selecting a pitch pattern from said plurality of predetermined pitch patterns for each syllable grouping of phonological linguistic unit indicia as generated by said analysis means, said pitch pattern being selected in dependence upon said pitch track data corresponding to each syllable grouping of phonological linguistic unit indicia; and
  
  transmission means connected to said analysis means and said pitch pattern recognizer means for transmitting said phonological linguistic unit indicia and pitch pattern indicia corresponding to said selected pitch patterns.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
- - 2. A speech encoding apparatus as claimed in claim 1, wherein:
    - said analysis means generated phonological linguistic unit indicia corresponding to phonemes of said received speech.
  - 3. A speech encoding apparatus as claimed in claim 1, wherein:
    - said analysis means generates phonological linguistic unit indicia corresponding to allophones of said received speech.
  - 4. A speech encoding apparatus as claimed in claim 1, wherein:
    - said analysis means generates phonological linguistic unit indicia corresponding to diphones of said received speech.
  - 5. A speech encoding apparatus as claimed in claim 1, wherein:
    - said pitch pattern recognizer means includes comparison means connected to said analysis means and said pitch pattern memory means for comparing the pitch track data for each syllable grouping of phonological linguistic unit indicia with each of said pitch patterns of said pitch pattern memory means and generating a measure of the similarity of said pitch track data to each of said pitch patterns, and selection means for selecting the pitch pattern from said plurality of predetermined pitch patterns having the best measure of similarity for each syllable grouping of phonological linguistic unit indicia.
  - 6. A speech encoding apparatus as claimed in claim 5, wherein:
    - said analysis means generates said pitch track data in a plurality of frames of data for each syllable; and
      
      said comparison means further includes first recomparison means for comparing said pitch track data omitting the first frame of data for each syllable with each of said pitch patterns and generating a measure of similarity, and second recomparison means for comparing said pitch track data omitting the last frame of data for each syllable with each of said pitch patterns and generating a measure of similarity.
  - 7. A speech encoding apparatus as claimed in claim 6, wherein:
    - said comparison means further includes third recomparison means for comparing said pitch track data omitting the first and last frames of data for each syllable with each of said pitch patterns and generating a measure of similarity.
  - 8. A speech encoding apparatus as claimed in claim 5, wherein:
    - said pitch pattern memory means in storing said plurality of predetermined pitch patterns includes therein a plurality of predetermined pitch slopes from which an initial pitch slope a final pitch slope, which an initial pitch slope, a final pitch slope, and a turning point may be selected for each of said plurality of pitch patterns.
  - 9. A speech encoding apparatus as claimed in claim 1, wherein:
    - said transmission means further includes means for transmitting an indication of the grouping of said phonological linguistic unit indicia into syllables.
  - 10. A speech encoding apparatus as claimed in claim 1, wherein:
    - said transmission means comprises machine readable optical bar codes.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Texas Instruments, Inc.
Original Assignee
Texas Instruments, Inc.
Inventors
Lin, Kun-Shan, Reimer, Jay B.
Primary Examiner(s)
KEMENY, EMANUEL

Application Number

US06/548,262
Time in Patent Office

1,916 Days
Field of Search

381/51-53, 381/36-40, 364/513.5
US Class Current

704/207
CPC Class Codes

G10L 19/0018 Speech coding using phoneti...

Low data rate speech encoding employing syllable pitch patterns

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

180 Citations

10 Claims

Specification

Use Cases

Quick Links

Others

Low data rate speech encoding employing syllable pitch patterns

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

180 Citations

10 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others