System and method for converting text-to-voice

US 6,862,568 B2
Filed: 03/27/2001
Issued: 03/01/2005
Est. Priority Date: 10/19/2000
Status: Active Grant

First Claim

Patent Images

1. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of voice recordings with each recording having a starting sonic feature and an ending sonic feature, the method including receiving text data, converting the text data into a sequence of voice recordings in accordance with the digital voice library and the set of playback rules, the method further comprising:

generating voice data based on the sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings, wherein concatenating a first recording and a second recording adjacent to the first recording includes manipulating the ending sonic feature of the first recording to determine a first recording switch point, manipulating the starting sonic feature of the second recording to determine a second recording switch point, and synchronizing the first recording switch point and the second recording switch point;

wherein the starting and ending sonic features of the voice recordings are classified into a number of different categories including a noise, an impulse, and a tone;

wherein the ending sonic feature of the first recording is a tone and the starting sonic feature of the second recording is a tone, and wherein synchronizing the first recording switch point and the second recording switch point further includes synchronizing the tones, and switching on peaks of the tones; and

wherein the recordings overlap, and wherein synchronizing during the overlap includes multiplexing.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules is provided. The method comprises generating voice data based on a sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings. Concatenating a first recording and a second recording adjacent to the first recording includes manipulating the ending sonic feature of the first recording to determine a first recording switch point, manipulating the starting sonic feature of the second recording to determine a second recording switch point, and synchronizing the first recording switch point and the second recording switch point.

Citations

8 Claims

1. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of voice recordings with each recording having a starting sonic feature and an ending sonic feature, the method including receiving text data, converting the text data into a sequence of voice recordings in accordance with the digital voice library and the set of playback rules, the method further comprising:
- generating voice data based on the sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings, wherein concatenating a first recording and a second recording adjacent to the first recording includes manipulating the ending sonic feature of the first recording to determine a first recording switch point, manipulating the starting sonic feature of the second recording to determine a second recording switch point, and synchronizing the first recording switch point and the second recording switch point;
  
  wherein the starting and ending sonic features of the voice recordings are classified into a number of different categories including a noise, an impulse, and a tone;
  
  wherein the ending sonic feature of the first recording is a tone and the starting sonic feature of the second recording is a tone, and wherein synchronizing the first recording switch point and the second recording switch point further includes synchronizing the tones, and switching on peaks of the tones; and
  
  wherein the recordings overlap, and wherein synchronizing during the overlap includes multiplexing.

2. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of voice recordings with each recording having a starting sonic feature and an ending sonic feature, the method including receiving text data, converting the text data into a sequence of voice recordings in accordance with the digital voice library and the set of playback rules, the method further comprising:
- generating voice data based on the sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings, wherein concatenating a first recording and a second recording adjacent to the first recording includes manipulating the ending sonic feature of the first recording to determine a first recording switch point, manipulating the starting sonic feature of the second recording to determine a second recording switch point, and synchronizing the first recording switch point and the second recording switch point;
  
  wherein the starting and ending sonic features of the voice recordings are classified into a number of different categories including a noise, an impulse, and a tone; and
  
  wherein the ending sonic feature of the first recording is a noise and the starting sonic feature of the second recording is a noise, and wherein synchronizing the first recording switch point and the second recording switch point includes switching anywhere within the noise such that not more than fifty percent of duration of either noises is cut.

3. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of voice recordings with each recording having a starting sonic feature and an ending sonic feature, the method including receiving text data, converting the text data into a sequence of voice recordings in accordance with the digital voice library and the set of playback rules, the method further comprising:
- generating voice data based on the sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings, wherein concatenating a first recording and a second recording adjacent to the first recording includes manipulating the ending sonic feature of the first recording to determine a first recording switch point, manipulating the starting sonic feature of the second recording to determine a second recording switch point, and synchronizing the first recording switch point and the second recording switch point;
  
  wherein the starting and ending sonic features of the voice recordings are classified into a number of different categories including a noise, an impulse, and a tone;
  
  wherein the ending sonic feature of the first recording is a tone and the starting sonic feature of the second recording is an impulse, and wherein synchronizing the first recording switch point and the second recording switch point further includes switching on a peak of the tone and on an impulse of the impulse; and
  
  wherein the tone and the impulse overlap, and wherein synchronizing during the overlap includes multiplexing.

4. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of voice recordings with each recording having a starting sonic feature and an ending sonic feature, the method including receiving text data, converting the text data into a sequence of voice recordings in accordance with the digital voice library and the set of playback rules, the method further comprising:
- generating voice data based on the sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings, wherein concatenating a first recording and a second recording adjacent to the first recording includes manipulating the ending sonic feature of the first recording to determine a first recording switch point, manipulating the starting sonic feature of the second recording to determine a second recording switch point, and synchronizing the first recording switch point and the second recording switch point;
  
  wherein the starting and ending sonic features of the voice recordings are classified into a number of different categories including a noise, an impulse, and a tone; and
  
  wherein the ending sonic feature of the first recording is a noise and the starting sonic feature of the second recording is an impulse, and wherein synchronizing the first recording switch point and the second recording switch point further includes switching anywhere within the noise such that not more than fifty percent of the noise is cut, and switching on an impulse of the impulse.

5. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of voice recordings with each recording having a starting sonic feature and an ending sonic feature, the method including receiving text data, converting the text data into a sequence of voice recordings in accordance with the digital voice library and the set of playback rules, the method further comprising:
- generating voice data based on the sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings, wherein concatenating a first recording and a second recording adjacent to the first recording includes manipulating the ending sonic feature of the first recording to determine a first recording switch point, manipulating the starting sonic feature of the second recording to determine a second recording switch point, and synchronizing the first recording switch point and the second recording switch point;
  
  wherein the starting and ending sonic features of the voice recordings are classified into a number of different categories including a noise, an impulse, and a tone; and
  
  wherein the ending sonic feature of the first recording is a noise and the starting sonic feature of the second recording is an tone, and wherein synchronizing the first recording switch point and the second recording switch point further includes switching anywhere within the noise such that not more than fifty percent of the noise is cut, and switching on a peak of the tone.

6. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of voice recordings with each recording having a starting sonic feature and an ending sonic feature, the method including receiving text data, converting the text data into a sequence of voice recordings in accordance with the digital voice library and the set of playback rules, the method further comprising;
- generating voice data based on the sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings, wherein concatenating a first recording and a second recording adjacent to the first recording includes manipulating the ending sonic feature of the first recording to determine a first recording switch point, manipulating the starting sonic feature of the second recording to determine a second recording switch point, and synchronizing the first recording switch point and the second recording switch point;
  
  wherein the starting and ending sonic features of the voice recordings are classified into a number of different categories including a noise, an impulse, and a tone;
  
  wherein the ending sonic feature of the first recording is an impulse and the starting sonic feature of the second recording is a tone, and wherein synchronizing the first recording switch point and the second recording switch point further includes switching at a peak of the tone and an end of the impulse; and
  
  wherein the impulse and the tone overlap, and wherein synchronizing during the overlap includes multiplexing.

7. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of voice recordings with each recording having a starting sonic feature and an ending sonic feature, the method including receiving text data, converting the text data into a sequence of voice recordings in accordance with the digital voice library and the set of playback rules, the method further comprising:
- generating voice data based on the sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings, wherein concatenating a first recording and a second recording adjacent to the first recording includes manipulating the ending sonic feature of the first recording to determine a first recording switch point, manipulating the starting sonic feature of the second recording to determine a second recording switch point, and synchronizing the first recording switch point and the second recording switch point;
  
  wherein the starting and ending sonic features of the voice recordings are classified into a number of different categories including a noise, an impulse, and a tone; and
  
  wherein the ending sonic feature of the first recording is an impulse and the starting sonic feature of the second recording is a noise, and wherein synchronizing the first recording switch point and the second recording switch point further includes switching anywhere within the noise such that not more than fifty percent of duration of the noise is cut, and switching an end of the impulse.

8. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of voice recordings with each recording having a starting sonic feature and an ending sonic feature, the method including receiving text data, converting the text data into a sequence of voice recordings in accordance with the digital voice library and the set of playback rules, the method further comprising:
- generating voice data based on the sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings, wherein concatenating a first recording and a second recording adjacent to the first recording includes manipulating the ending sonic feature of the first recording to determine a first recording switch point, manipulating the starting sonic feature of the second recording to determine a second recording switch point, and synchronizing the first recording switch point and the second recording switch point;
  
  wherein the starting and ending sonic features of the voice recordings are classified into a number of different categories including a noise, an impulse, and a tone; and
  
  wherein the ending sonic feature of the first recording is an tone and the starting sonic feature of the second recording is a noise, and wherein synchronizing the first recording switch point and the second recording switch point further includes switching anywhere within the noise such that not more than fifty percent of duration of the noise is cut, and switching at a peak of the tone.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Qwest Communications International Incorporated (Lumen Technologies, Inc.)
Original Assignee
Qwest Communications International Incorporated (Lumen Technologies, Inc.)
Inventors
Case, Eliot M.
Primary Examiner(s)
Young, W. R.
Assistant Examiner(s)
Wozniak, James S.

Application Number

US09/818,208
Publication Number

US 20020077822A1
Time in Patent Office

1,435 Days
Field of Search

704/260, 704/258, 704/268, 704/278, 704/265, 704/267
US Class Current

704/260
CPC Class Codes

G10L 13/04 Details of speech synthesis...

G10L 13/07 Concatenation rules

System and method for converting text-to-voice

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for converting text-to-voice

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links