Personalized text-to-speech services

US 8,918,322 B1
Filed: 06/20/2007
Issued: 12/23/2014
Est. Priority Date: 06/30/2000
Status: Expired due to Term

First Claim

Patent Images

1. A method comprising:

receiving, from a sender, a textual message generated by a spoken dialog system, the textual message having a fixed text portion and a variable text portion;

selecting, based on voice characteristics of the sender and the sender speaking a particular set of lines, a speech template from a plurality of speech templates, the speech template comprising information representing characteristics of an individual'"'"'s voice, wherein each speech template in the plurality of speech templates is personalized to the individual and in a distinct language from other speech templates in the plurality of speech templates;

accessing pre-recorded speech from storage, the pre-recorded speech corresponding to the fixed text portion of the textual message;

generating variable speech corresponding to the variable text portion of the textual message; and

merging the pre-recorded speech and the variable speech in an order defined by the speech template.

View all claims

6 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A personalized text-to-speech (pTTS) system provides a method for converting text data to speech data utilizing a pTTS template representing the voice characteristics of an individual. A memory stores executable program code that converts text data to speech data. Text data represents a textual message directed to a system user and speech data represents a spoken form of text data having the characteristics of an individual'"'"'s voice. A processor executes the program code, and a storage device stores a pTTS template and may store speech data. The pTTS system can be used to provide various services that provide immediate spoken presentation of the speech data converted from text data and/or combine stored speech data with generated speech data for spoken presentation.

14 Citations

View as Search Results

19 Claims

1. A method comprising:
- receiving, from a sender, a textual message generated by a spoken dialog system, the textual message having a fixed text portion and a variable text portion;
  
  selecting, based on voice characteristics of the sender and the sender speaking a particular set of lines, a speech template from a plurality of speech templates, the speech template comprising information representing characteristics of an individual'"'"'s voice, wherein each speech template in the plurality of speech templates is personalized to the individual and in a distinct language from other speech templates in the plurality of speech templates;
  
  accessing pre-recorded speech from storage, the pre-recorded speech corresponding to the fixed text portion of the textual message;
  
  generating variable speech corresponding to the variable text portion of the textual message; and
  
  merging the pre-recorded speech and the variable speech in an order defined by the speech template.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
- - 2. The method according to claim 1, wherein selecting of the speech template is further based on an attribute that is an identifier of the sender.
  - 3. The method according to claim 1, wherein the individual'"'"'s voice is associated with an individual who is not the sender.
  - 4. The method according to claim 1, wherein:
    - accessing the pre-recorded speech is based on an attribute of the sender, and wherein each of a plurality of speech segments of the pre-recorded speech has characteristics of a unique individual'"'"'s voice.
  - 5. The method according to claim 4, wherein the attribute is one of age and gender.
  - 6. The method according to claim 1, wherein the speech template represents the characteristics of the voice of one of a parent, sibling, relative, teacher, and friend of the recipient.
  - 7. The method according to claim 6, wherein a user receives the spoken version of the textual message with one of a telephone and telephone application programming interface equipped device coupled across a telephone network to a computer.
  - 8. The method according to claim 1, wherein the textual message comprises one of an e-mail message and a manuscript text.
  - 9. The method according to claim 1, further comprising:
    - receiving a voice sample from a user; and
      
      generating a user specific speech template for the user based on the voice sample.
  - 10. The method of claim 1, wherein the individual'"'"'s voice is associated with an individual who is also the sender.

11. A system comprising:
- a processor; and
  
  a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising;
  
  receiving, from a sender, a textual message generated by a spoken dialog system, the textual message having a fixed text portion and a variable text portion;
  
  selecting, based on voice characteristics of the sender and the sender speaking a particular set of lines, a speech template from a plurality of speech templates, the speech template comprising information representing characteristics of an individual'"'"'s voice, wherein each speech template in the plurality of speech templates is personalized to the individual and in a distinct language from other speech templates in the plurality of speech templates;
  
  accessing pre-recorded speech from storage, the pre-recorded speech corresponding to the fixed text portion of the textual message;
  
  generating variable speech corresponding to the variable text portion of the textual message; and
  
  merging the pre-recorded speech and the variable speech in an order defined by the speech template.
- View Dependent Claims (12, 13, 14, 15, 16)
- - 12. The system according to claim 11, wherein selecting of the speech template further comprises selecting the speech template based on an attribute that is an identifier of the sender.
  - 13. The system according to claim 11, wherein:
    - accessing the pre-recorded speech further comprises accessing the pre-recorded speech based on an attribute of the user, and wherein each of a plurality of speech segments of the pre-recorded speech has characteristics of a unique individual'"'"'s voice.
  - 14. The system according to claim 11, the computer-readable storage medium having additional instructions stored which result in the operations further comprising:
    - receiving a voice sample from a user; and
      
      generating a user specific speech template for the user based on the voice sample.
  - 15. The system of claim 11, wherein the individual'"'"'s voice is associated with an individual who is also the sender.
  - 16. The system of claim 11, wherein the individual'"'"'s voice is associated with an individual who is not the sender.

17. A computer-readable device having instructions stored, which, when executed by a computing device, cause the computing device to perform operations comprising:
- receiving, from a sender, a textual message generated by a spoken dialog system, the textual message having a fixed text portion and a variable text portion;
  
  selecting, based on voice characteristics of the sender and the sender speaking a particular set of lines, a speech template from a plurality of speech templates, the speech template comprising information representing characteristics of an individual'"'"'s voice, wherein each speech template in the plurality of speech templates is personalized to the individual and in a distinct language from other speech templates in the plurality of speech templates;
  
  accessing pre-recorded speech from storage, the pre-recorded speech corresponding to the fixed text portion of the textual message;
  
  generating variable speech corresponding to the variable text portion of the textual message; and
  
  merging the pre-recorded speech and the variable speech in an order defined by the speech template.
- View Dependent Claims (18, 19)
- - 18. The computer-readable storage device of claim 17, wherein the individual'"'"'s voice is associated with an individual who is also the sender.
  - 19. The computer-readable storage device of claim 17, wherein the individual'"'"'s voice is associated with an individual who is not the sender.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Cerence Operating Company (Cerence Inc.)
Original Assignee
AT&T Intellectual Property II LP (AT&T, Inc.)
Inventors
Acker, Edmund Gale, Burg, Frederick Murray
Primary Examiner(s)
Armstrong, Angela A

Application Number

US11/765,773
Time in Patent Office

2,743 Days
Field of Search

704/260, 704/270.1, 704/258, 704/270
US Class Current

704/260
CPC Class Codes

G10L 13/02   Methods for producing synth...

G10L 13/033   Voice editing, e.g. manipul...

G10L 19/00   Speech or audio signals ana...

Personalized text-to-speech services

First Claim

6 Assignments

0 Petitions

Accused Products

Abstract

14 Citations

19 Claims

Specification

Solutions

Use Cases

Quick Links

Personalized text-to-speech services

First Claim

6 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

14 Citations

19 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links