Synthesizing speech by converting phonemes to digital waveforms
First Claim
1. A method of converting an input signal representing a text in phonemes into an output digital waveform signal convertible into an acoustic synthesized speech waveform corresponding to said text, wherein said method makes use of a two-part database having an access section based on strings of phonemes and an output section containing digital waveforms corresponding to the linked access sections, wherein said method comprises:
- matching a segment of said input signal to select the best match of strings contained in the access section said best match including an exact match for at least one internal phoneme, anddiscarding at least the first and last phonemes of said best match to identify a shorter string of phonemes which is an exact match for a portion of said input signal.
0 Assignments
0 Petitions
Accused Products
Abstract
Synthetic speech is generated by production of a digital waveform from a text in phonemes. A linked database is used which comprises an extended text in phonemes and its equivalent in the form of a digital waveform. The two portions of the database are linked by a parameter which establishes equivalent points in both the phoneme text and the digital waveform. The input text (in phonemes) is analyzed to locate a matching portion in the phoneme portion of the database. This matching utilizes exact equivalence of phonemes where this is possible; otherwise relation between phonemes is utilized. The selection process identifies input phonemes in context whereby improved conversions are obtained. Having analyzed the input exit into matching strings in the input form of the database beginning and ending parameters for the sections are established. The output text is produced by abutting sections of the digital waveform and defined by the beginning and ending parameters.
-
Citations
3 Claims
-
1. A method of converting an input signal representing a text in phonemes into an output digital waveform signal convertible into an acoustic synthesized speech waveform corresponding to said text, wherein said method makes use of a two-part database having an access section based on strings of phonemes and an output section containing digital waveforms corresponding to the linked access sections, wherein said method comprises:
-
matching a segment of said input signal to select the best match of strings contained in the access section said best match including an exact match for at least one internal phoneme, and discarding at least the first and last phonemes of said best match to identify a shorter string of phonemes which is an exact match for a portion of said input signal. - View Dependent Claims (2, 3)
-
Specification