Allophonic text-to-speech generator
First Claim
1. A text processor for a text-to-speech synthesizer comprising:
- a computer including a central processing unit having random access memory and read only memory for holding an operating system program and one or more application programs;
a phonetic text database for storing phonetic transcriptions corresponding to phonemes;
means for accessing the phonetic database to retrieve phonetic text characters corresponding to a desired word;
program means for converting the phonetic text characters into allophonic text characters to generate a string of allophonic text characters corresponding to the desired word;
an audio database comprising pre-recorded allophones stored in accordance with the allophonic text representative of each of said allophones;
means for extracting from the audio database the allophonic audio signals that correspond to the string of allophonic text in the desired word; and
means for concatenating the allophonic audio signals together to generate a new audio file corresponding to the desired word.
11 Assignments
0 Petitions
Accused Products
Abstract
The allophonic text-to-speech generator (ATTG) 10 includes a CPU 100. The CPU has a random access memory 102 and a read only memory 104 for holding the operating system, application programs, and data for the CPU 100. A keyboard 110 provides a user with control over the CPU 100. A database 130 holds phonetic transcritps of words. Such databases are well-known in the field of telephone directory assistance. A second database 140 maps allophonic text to parse and pre-recorded allophones. The CPU 100 converts a phonetic transcript of a word into an allophonic text string in accordance with a rules program 120. Then the CPU 100 extracts the audio allophone files of the allophonic string and concatenates the audio files to form the new word in the same voice as the other words fromed from the allophones in database 140.
41 Citations
6 Claims
-
1. A text processor for a text-to-speech synthesizer comprising:
-
a computer including a central processing unit having random access memory and read only memory for holding an operating system program and one or more application programs; a phonetic text database for storing phonetic transcriptions corresponding to phonemes; means for accessing the phonetic database to retrieve phonetic text characters corresponding to a desired word; program means for converting the phonetic text characters into allophonic text characters to generate a string of allophonic text characters corresponding to the desired word; an audio database comprising pre-recorded allophones stored in accordance with the allophonic text representative of each of said allophones; means for extracting from the audio database the allophonic audio signals that correspond to the string of allophonic text in the desired word; and means for concatenating the allophonic audio signals together to generate a new audio file corresponding to the desired word. - View Dependent Claims (2, 3)
-
-
4. A method for building speech from text with a computer including a central processing unit having random access memory and read only memory for holding an operating system program and one or more application programs, comprising the steps of:
-
inputting phonetic text characters corresponding to a desired spoken work; mapping the phonetic text characters to allophonic text characters to generate a string of allophonic text characters; providing a file of prerecorded audio signals comprising allophonic audio signals corresponding to the allophonic text characters; extracting from the file of prerecorded audio signals the allophonic audio signals that correspond to the string of allophonic text characters; and concatenating the allophonic audio signals together and generating an output audio signal representative of the input orthographic text. - View Dependent Claims (5, 6)
-
Specification