System and method for converting text-to-voice

US 20020103648A1
Filed: 03/27/2001
Published: 08/01/2002
Est. Priority Date: 10/19/2000
Status: Active Grant

First Claim

Patent Images

1. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of speech items and a corresponding plurality of voice recordings wherein each speech item corresponds to at least one available voice recording wherein multiple voice recordings that correspond to a single speech item represent various inflections of that single speech item, the method including receiving text data, converting the text data into a sequence of speech items in accordance with the digital voice library, the method further comprising:

establishing multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, that represent various ligatures for the single inflection of the single speech item with adjacent speech items.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules is provided. The digital voice library includes a plurality of speech items and a corresponding plurality of voice recordings. Each speech item corresponds to at least one available voice recording. Multiple voice recordings that correspond to a single speech item represent various inflections of that single speech item. The method includes receiving text data, converting the text data into a sequence of speech items in accordance with the digital voice library. The method further includes establishing multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, that represent various ligatures for the single inflection of the single speech item with adjacent speech items.

Citations

16 Claims

1. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of speech items and a corresponding plurality of voice recordings wherein each speech item corresponds to at least one available voice recording wherein multiple voice recordings that correspond to a single speech item represent various inflections of that single speech item, the method including receiving text data, converting the text data into a sequence of speech items in accordance with the digital voice library, the method further comprising:
- establishing multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, that represent various ligatures for the single inflection of the single speech item with adjacent speech items.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. The method of claim 1 wherein the multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, represent various ending ligatures for ending phonemes of the single inflection of the single speech item with beginning phonemes of adjacent speech items.
  - 3. The method of claim 1 wherein the multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, represent various beginning ligatures for beginning phonemes of the single inflection of the single speech item with ending phonemes of adjacent speech items.
  - 4. The method of claim 1 wherein the multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, represent various beginning and ending ligatures for beginning and ending phonemes of the single inflection of the single speech item with ending and beginning phonemes of adjacent speech items.
  - 5. The method of claim 4 wherein the ligatures include ligatures associated with vowel staging.
  - 6. The method of claim 5 wherein the ligatures include ligatures associated with vowel staging, consonant staging, and fricative consonant staging.
  - 7. The method of claim 1 wherein the ligatures include ligatures associated with vowel staging.
  - 8. The method of claim 7 wherein the ligatures include ligatures associated with vowel staging, consonant staging, and fricative consonant staging.

9. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of speech items and a corresponding plurality of voice recordings wherein each speech item corresponds to at least one available voice recording wherein multiple voice recordings that correspond to a single speech item represent various inflections of that single speech item, the method including receiving text data, converting the text data into a sequence of speech items in accordance with the digital voice library, the method further comprising:
- establishing multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, that represent various ligatures for the single inflection of the single speech item with adjacent speech items;
  
  determining a desired inflection for each speech item in the sequence of speech items based on the set of playback rules;
  
  determining a sequence of voice recordings by determining a voice recording for each speech item based on the desired inflection for the particular speech item, the available voice recordings that correspond to the particular speech item, and the ligatures for the particular speech item with adjacent speech items; and
  
  generating voice data based on the sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings.
- View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
- - 10. The method of claim 9 wherein the multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, represent various ending ligatures for ending phonemes of the single inflection of the single speech item with beginning phonemes of adjacent speech items, and wherein determining the sequence of voice recordings by determining a voice recording for each speech item is further based on ending ligatures for ending phonemes of the particular speech item with beginning phonemes of adjacent speech items.
  - 11. The method of claim 9 wherein the multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, represent various beginning ligatures for beginning phonemes of the single inflection of the single speech item with ending phonemes of adjacent speech items, and wherein determining the sequence of voice recordings by determining a voice recording for each speech item is further based on beginning ligatures for beginning phonemes of the particular speech item with ending phonemes of adjacent speech items.
  - 12. The method of claim 9 wherein the multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, represent various beginning and ending ligatures for beginning and ending phonemes of the single inflection of the single speech item with ending and beginning phonemes of adjacent speech items, and wherein determining the sequence of voice recordings by determining a voice recording for each speech item is further based on beginning and ending ligatures for beginning and ending phonemes of the particular speech item with ending and beginning phonemes of adjacent speech items.
  - 13. The method of claim 12 wherein the ligatures include ligatures associated with vowel staging.
  - 14. The method of claim 13 wherein the ligatures include ligatures associated with vowel staging, consonant staging, and fricative consonant staging.
  - 15. The method of claim 9 wherein the ligatures include ligatures associated with vowel staging.
  - 16. The method of claim 15 wherein the ligatures include ligatures associated with vowel staging, consonant staging, and fricative consonant staging.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Qwest Communications International Incorporated (Lumen Technologies, Inc.)
Original Assignee
Qwest Communications International Incorporated (Lumen Technologies, Inc.)
Inventors
Case, Eliot M., Weirauch, Judith L.

Granted Patent

US 6,871,178 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/260
CPC Class Codes

G10L 13/04 Details of speech synthesis...

G10L 13/08 Text analysis or generation...

System and method for converting text-to-voice

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

Citations

16 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for converting text-to-voice

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

16 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links