Personal message service with enhanced text to speech synthesis

US 7,027,568 B1
Filed: 10/10/1997
Issued: 04/11/2006
Est. Priority Date: 10/10/1997
Status: Expired due to Fees

First Claim

Patent Images

1. A system comprising:

a server coupled to a data communication network, said server being programmed to execute sequences of program instructions for;

(a) obtaining textual information for forming messages for a plurality of subscribers,(b) performing a significant portion of a text to speech process to convert the textual information of at least one of the messages to speech synthesizer instructions, and(c) transmitting the speech synthesizer instructions over the data communication network; and

a subscriber terminal for receiving the speech synthesizer instructions via the data communication network, said subscriber terminal comprising a speech synthesizer and a vocabulary of speech sounds for synthesizing a speech waveform signal representing the at least one message from the speech synthesizer instructions.

View all claims

5 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A server in a network gathers textual information, such as news items, E-mail and the like. From that information, the server develops or identifies messages for use by individual subscribers. The same server that accumulates the text messages or another server in the network converts the textual information in each message to a sequence of speech synthesizer instructions. The converted messages, containing the sequences of speech synthesizer instructions, are transmitted to each identified subscriber'"'"'s terminal device. A synthesizer in the terminal generates an audio waveform signal, representing the speech information, in response to the instructions. In the preferred embodiment, the terminals utilize concatenative type speech synthesizers, each of which has an associated vocabulary of stored fundamental sound samples. The instructions identify the sound samples, in order. The instructions also provide parameters for controlling characteristics of the signal generated during waveform synthesis for each sound sample in each sequence. For example, the instructions may specify the pitch, duration, amplitude, attack envelope and decay envelope for each sample. The division of the text to speech synthesis processing between the server and the terminals places the cost of the front end processing in the server, which is a shared resource. As a result, the hardware and software of the terminal may be relatively simple and inexpensive. Also, it is possible to upgrade the quality of the synthesis by upgrading the server software, without modifying the terminals.

293 Citations

27 Claims

1. A system comprising:
- a server coupled to a data communication network, said server being programmed to execute sequences of program instructions for;
  
  (a) obtaining textual information for forming messages for a plurality of subscribers,(b) performing a significant portion of a text to speech process to convert the textual information of at least one of the messages to speech synthesizer instructions, and(c) transmitting the speech synthesizer instructions over the data communication network; and
  
  a subscriber terminal for receiving the speech synthesizer instructions via the data communication network, said subscriber terminal comprising a speech synthesizer and a vocabulary of speech sounds for synthesizing a speech waveform signal representing the at least one message from the speech synthesizer instructions.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
- - 2. A system as in claim 1, wherein the server includes means for transmitting the speech synthesizer instructions over a packet switched data network.
  - 3. A system as in claim 1, wherein the terminal further comprises a programmable central processing unit and an interface coupled to the programmable central processing unit for communication via the data network.
  - 4. A system as in claim 3, wherein the interface comprises a modem.
  - 5. A system as in claim 4, wherein the modem comprises a wireless network data modem.
  - 6. A system as in claim 5, wherein the wireless network data modem comprises a cellular digital packet data (CDPD) modem.
  - 7. A system as in claim 1, further comprising a mail system for receiving mail messages for subscribers and supplying the mail messages as the textual information to the server for conversion and transmission to the subscriber terminal.
  - 8. A system as in claim 7, further comprising a news information server, said server being programmed to execute sequences of program instructions for:
    - storing profile information regarding news topics of interest to individual subscribers;
      
      receiving and storing news items from one or more sources;
      
      comparing the stored news items to the stored profile information to identify news items of interest to each individual subscriber;
      
      addressing mail messages containing text information representing the items of interest to subscribers mail boxes in the mail system; and
      
      transmitting the mail messages containing text information representing the items of interest to the mail system.
  - 9. A system as in claim 1, further comprising a unified message management platform for receiving mail messages for subscribers in a plurality of different formats including text format, and at least one other format, converting mail messages from the at least one other format to the text format, and supplying the text format mail messages to the server as the textual information for conversion and transmission to the subscriber terminal.
  - 10. A system as in claim 1, wherein the server also is programmed to execute sequences of program instructions for:
    - storing profile information regarding news topics of interest to individual subscribers;
      
      receiving and storing news items from one or more sources; and
      
      comparing the stored news items to the stored profile information to identify news items of interest to each individual subscriber,wherein said textual information of at least one of the messages comprises one of the identified news items.
  - 11. A system as in claim 1, wherein the speech synthesizer comprises:
    - a memory storing a plurality of fundamental sound samples, in digitized form; and
      
      a concatenative speech synthesizer responsive to the instructions, for processing samples from the memory in an order specified by the instructions and to control parameters of each of the processed samples in a manner specified in the instructions, to thereby generate the speech waveform signal.

12. A network server, comprising:
- a computer coupled to a data communication network, said computer being programmed to execute sequences of program instructions for;
  
  (a) obtaining textual information for messages for a plurality of subscribers;
  
  (b) performing a significant portion of a text to speech process to convert the textual information of the messages to speech synthesizer instructions each speech synthesizer instruction identifying a fundamental sound and at least one control parameter for controlling generation of a waveform corresponding to the fundamental sound in a vocabulary of fundamental sounds stored in a subscriber terminal; and
  
  (c) transmitting sequences of the speech synthesizer instructions, representing the messages, over the data communication network to subscriber terminals for waveform generation in response thereto.
- View Dependent Claims (13)
- - 13. A network server as in claim 12, wherein the server also is programmed to execute sequences of program instructions for:
    - storing profile information regarding news topics of interest to individual subscribers; and
      
      receiving and storing news items from one or more sources;
      
      comparing the stored news items to the stored profile information to identify news items of interest to each individual subscriber,wherein said textual information of at least one of the messages comprises one of the identified news items.

14. A communication terminal device, comprising:
- a data interface for receiving data from a communication network;
  
  a programmable central processing unit for processing the received data to capture speech synthesizer instructions contained in the received data;
  
  a memory storing a vocabulary comprising a plurality of fundamental sound samples, in digitized form; and
  
  a concatenative speech synthesizer responsive to the instructions, for processing samples from the memory in an order specified by the instructions and to control parameters of a waveform signal synthesized from the processed samples in a manner specified in the instructions.
- View Dependent Claims (15, 16, 17, 18, 26)
- - 15. A terminal as in claim 14, wherein the interface comprises a modem.
  - 16. A terminal as in claim 15, wherein the modem comprises a wireless network data modem.
  - 17. A terminal as in claim 16, wherein the wireless network data modem comprises a cellular digital packet data (CDPD) modem.
  - 18. A terminal as in claim 14, further comprising:
    - a keyboard for supplying user inputs to the programmable central processing unit; and
      
      a display for displaying information provided by the programmable central processing unit.
  - 26. A communication terminal as recited in claim 14, wherein said speech synthesizer instructions are in the form of MIDI (Musical Instrument Digital Interface) commands.

19. A method of providing personalized information services, comprising:
- storing subscriber profiles relating to topics of interest to a plurality of individual subscribers;
  
  receiving items of information from a plurality of sources;
  
  comparing the items of information to the subscriber profiles to identify items of interest to particular subscribers;
  
  converting textual information relating to at least some of the identified items of interest to sequences of speech synthesizer instructions;
  
  transmitting each of the sequences of instructions to one or more terminals, each terminal being utilized by a subscriber;
  
  storing received sequences of instructions in respective subscriber terminals;
  
  in response to one of the sequences of instructions, retrieving sound samples from a memory comprising a vocabulary in a subscriber terminal in an order specified by the one sequence of instructions and adjusting process parameters for the retrieved samples in a manner specified by the one sequence of instructions, to thereby generate a speech waveform signal representative of one of the identified items of interest.
- View Dependent Claims (20, 21, 22, 23, 24, 25)
- - 20. A method as in claim 19, wherein the step of converting textual information relating to at least some of the identified items of interest to sequences of speech synthesizer instructions comprises:
    - computing linguistic parameter specifications from input text data;
      
      converting the linguistic parameters into synthesizer control parameters, said synthesizer control parameters identifying the samples in an order corresponding to the input text data and specifying the manner of adjusting the process parameters for the identified samples.
  - 21. A method as in claim 19, wherein the step of transmitting comprises transmitting at least some of the sequences of instructions over a wireless data link to a plurality of the respective subscriber terminals.
  - 22. A method as in claim 19, wherein the step of transmitting comprises transmitting at least some of the sequences of instructions via a packet switched data network.
  - 23. A method as in claim 22, wherein the public switched packet data network comprises the Internet.
  - 24. A method as in claim 19, wherein at least one of the respective subscriber terminals comprises a portable device with wireless data communication capability enabling wireless reception of sequences of the instructions.
  - 25. A method as in claim 19, wherein at least one of the respective subscriber terminals comprises a personal computer for coupling to a public data network enabling reception of sequences of the instructions via the public data network.

27. A system comprising:
- a server coupled to a data communication network, said server being programmed to execute sequences of program instructions for;
  
  (a) obtaining textual information for forming messages for a plurality of subscribers,(b) performing a significant portion of a text to speech process to convert the textual information of at least one of the messages to speech synthesizer instructions in the form of MIDI (Musical Instrument Digital Interface) commands, and(c) transmitting the speech synthesizer instructions over the data communication network; and
  
  a subscriber terminal for receiving the speech synthesizer instructions via the data communication network, said subscriber terminal comprising a speech synthesizer and a vocabulary of speech sounds for synthesizing a speech waveform signal representing the at least one message from the speech synthesizer instructions.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Google LLC (Alphabet Inc.)
Original Assignee
Verizon Services Corporation (Verizon Communications Inc.)
Inventors
McAllister, Alexander I., Curry, James E., Simpson, David
Primary Examiner(s)
Hoosain, Allan

Application Number

US08/948,328
Time in Patent Office

3,105 Days
Field of Search

379/67.1, 379/88.13, 379/88.17, 379/93.24, 379/93.25, 379/100.07, 379/900, 379/101.01, 379/88.07, 379/88.08, 379/88.12, 379/88.16, 379/88.18, 704/270, 704/275
US Class Current

379/88.16
CPC Class Codes

G10L 13/00   Speech synthesis; Text to s...

H04M 1/72436   for text messaging, e.g. sh...

H04M 2201/60   Medium conversion

H04M 3/493   Interactive information ser...

H04M 3/4938   comprising a voice browser ...

Personal message service with enhanced text to speech synthesis

First Claim

5 Assignments

0 Petitions

Accused Products

Abstract

293 Citations

27 Claims

Specification

Solutions

Use Cases

Quick Links

Personal message service with enhanced text to speech synthesis

First Claim

5 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

293 Citations

27 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links