TEXT PRE-PROCESSING FOR TEXT-TO-SPEECH GENERATION
First Claim
1. A system for pre-processing text for text-to-speech (TTS) generation, comprising:
- a first memory adapted to store a text information database;
a second memory adapted to store grammar rules;
a receiver adapted to receive update data regarding the grammar rules and relay the received update data to the second memory;
an audio output device; and
a TTS engine operatively coupled to the first and second memories, the receiver, and the audio output device, the TTS engine being adapted to;
retrieve at least one text entry from the text information database;
apply the updated grammar rules to the at least one text entry, and thereby pre-process the at least one text entry;
generate speech based at least in part on the least one pre-processed text entry; and
send the generated speech to the audio output device;
wherein the audio output device plays the generated speech.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method are provided for improved speech synthesis, wherein text data is pre-processed according to updated grammar rules or a selected group of grammar rules. In one embodiment, the TTS system comprises a first memory adapted to store a text information database, a second memory adapted to store grammar rules, and a receiver adapted to receive update data regarding the grammar rules. The system also includes a TTS engine adapted to retrieve at least one text entry from the text information database, pre-process the at least one text entry by applying the updated grammar rules to the at least one text entry, and generate speech based at least in part on the least one pre-processed text entry.
-
Citations
20 Claims
-
1. A system for pre-processing text for text-to-speech (TTS) generation, comprising:
-
a first memory adapted to store a text information database; a second memory adapted to store grammar rules; a receiver adapted to receive update data regarding the grammar rules and relay the received update data to the second memory; an audio output device; and a TTS engine operatively coupled to the first and second memories, the receiver, and the audio output device, the TTS engine being adapted to; retrieve at least one text entry from the text information database; apply the updated grammar rules to the at least one text entry, and thereby pre-process the at least one text entry; generate speech based at least in part on the least one pre-processed text entry; and send the generated speech to the audio output device; wherein the audio output device plays the generated speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for pre-processing text for text-to-speech (TTS) generation, comprising:
-
a memory adapted to store a text information database and grammar rules; a receiver to receive a request for the TTS generation; an audio output device; and a TTS engine operatively coupled to the memory, the receiver, and the audio output device, the TTS engine being adapted to; retrieve at least one text entry from the text information database according to the received request; retrieve a subset of rules from the grammar rules according to the received request; apply the retrieved rules to the at least one text entry, and thereby pre-process the at least one text entry; generate speech based at least in part on the least one pre-processed text entry; and send the generated speech to the audio output device; wherein the audio output device plays the generated speech in response to the received request for the TTS generation. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A method for pre-processing text for a text-to-speech (TTS) engine according to grammar rules, comprising:
-
receiving update data regarding the grammar rules; updating the grammar rules according to the received update data; receiving a request for TTS generation; retrieving at least one text entry from a text information database; applying the updated grammar rules to the at least one text entry to pre-process the at least one text entry; and providing an audio output with TTS phonetics based at least in part on the at least one pre-processed text entry. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification