Personal message service with enhanced text to speech synthesis
First Claim
1. A system comprising:
- a server coupled to a data communication network, said server being programmed to execute sequences of program instructions for;
(a) obtaining textual information for forming messages for a plurality of subscribers,(b) performing a significant portion of a text to speech process to convert the textual information of at least one of the messages to speech synthesizer instructions, and(c) transmitting the speech synthesizer instructions over the data communication network; and
a subscriber terminal for receiving the speech synthesizer instructions via the data communication network, said subscriber terminal comprising a speech synthesizer and a vocabulary of speech sounds for synthesizing a speech waveform signal representing the at least one message from the speech synthesizer instructions.
5 Assignments
0 Petitions
Accused Products
Abstract
A server in a network gathers textual information, such as news items, E-mail and the like. From that information, the server develops or identifies messages for use by individual subscribers. The same server that accumulates the text messages or another server in the network converts the textual information in each message to a sequence of speech synthesizer instructions. The converted messages, containing the sequences of speech synthesizer instructions, are transmitted to each identified subscriber'"'"'s terminal device. A synthesizer in the terminal generates an audio waveform signal, representing the speech information, in response to the instructions. In the preferred embodiment, the terminals utilize concatenative type speech synthesizers, each of which has an associated vocabulary of stored fundamental sound samples. The instructions identify the sound samples, in order. The instructions also provide parameters for controlling characteristics of the signal generated during waveform synthesis for each sound sample in each sequence. For example, the instructions may specify the pitch, duration, amplitude, attack envelope and decay envelope for each sample. The division of the text to speech synthesis processing between the server and the terminals places the cost of the front end processing in the server, which is a shared resource. As a result, the hardware and software of the terminal may be relatively simple and inexpensive. Also, it is possible to upgrade the quality of the synthesis by upgrading the server software, without modifying the terminals.
293 Citations
27 Claims
-
1. A system comprising:
-
a server coupled to a data communication network, said server being programmed to execute sequences of program instructions for; (a) obtaining textual information for forming messages for a plurality of subscribers, (b) performing a significant portion of a text to speech process to convert the textual information of at least one of the messages to speech synthesizer instructions, and (c) transmitting the speech synthesizer instructions over the data communication network; and a subscriber terminal for receiving the speech synthesizer instructions via the data communication network, said subscriber terminal comprising a speech synthesizer and a vocabulary of speech sounds for synthesizing a speech waveform signal representing the at least one message from the speech synthesizer instructions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A network server, comprising:
-
a computer coupled to a data communication network, said computer being programmed to execute sequences of program instructions for; (a) obtaining textual information for messages for a plurality of subscribers; (b) performing a significant portion of a text to speech process to convert the textual information of the messages to speech synthesizer instructions each speech synthesizer instruction identifying a fundamental sound and at least one control parameter for controlling generation of a waveform corresponding to the fundamental sound in a vocabulary of fundamental sounds stored in a subscriber terminal; and (c) transmitting sequences of the speech synthesizer instructions, representing the messages, over the data communication network to subscriber terminals for waveform generation in response thereto. - View Dependent Claims (13)
-
-
14. A communication terminal device, comprising:
-
a data interface for receiving data from a communication network; a programmable central processing unit for processing the received data to capture speech synthesizer instructions contained in the received data; a memory storing a vocabulary comprising a plurality of fundamental sound samples, in digitized form; and a concatenative speech synthesizer responsive to the instructions, for processing samples from the memory in an order specified by the instructions and to control parameters of a waveform signal synthesized from the processed samples in a manner specified in the instructions. - View Dependent Claims (15, 16, 17, 18, 26)
-
-
19. A method of providing personalized information services, comprising:
-
storing subscriber profiles relating to topics of interest to a plurality of individual subscribers; receiving items of information from a plurality of sources; comparing the items of information to the subscriber profiles to identify items of interest to particular subscribers; converting textual information relating to at least some of the identified items of interest to sequences of speech synthesizer instructions; transmitting each of the sequences of instructions to one or more terminals, each terminal being utilized by a subscriber; storing received sequences of instructions in respective subscriber terminals; in response to one of the sequences of instructions, retrieving sound samples from a memory comprising a vocabulary in a subscriber terminal in an order specified by the one sequence of instructions and adjusting process parameters for the retrieved samples in a manner specified by the one sequence of instructions, to thereby generate a speech waveform signal representative of one of the identified items of interest. - View Dependent Claims (20, 21, 22, 23, 24, 25)
-
-
27. A system comprising:
-
a server coupled to a data communication network, said server being programmed to execute sequences of program instructions for; (a) obtaining textual information for forming messages for a plurality of subscribers, (b) performing a significant portion of a text to speech process to convert the textual information of at least one of the messages to speech synthesizer instructions in the form of MIDI (Musical Instrument Digital Interface) commands, and (c) transmitting the speech synthesizer instructions over the data communication network; and a subscriber terminal for receiving the speech synthesizer instructions via the data communication network, said subscriber terminal comprising a speech synthesizer and a vocabulary of speech sounds for synthesizing a speech waveform signal representing the at least one message from the speech synthesizer instructions.
-
Specification