Synthesising speech by converting phonemes to digital waveforms
First Claim
1. A database for use as a component of a speech engine said database comprising:
- an output section containing an extended digital waveform,an access section containing signals representing said extended digital waveform in phonemes, anda common address parameter identifying common points in both sections, whereby identification of a segment of the signals representing said extended digital waveform in phonemes in the access section establishes beginning and ending values for the parameter and hence identifies the corresponding segment of the extended digital waveform.
0 Assignments
0 Petitions
Accused Products
Abstract
Synthetic speech is generated by production of a digital waveform from a text in phonemes. A linked database is used which comprises an extended text in phonemes and its equivalent in the form of a digital waveform. The two portions of the database are linked by a parameter which establishes equivalent points in both the phoneme text and the digital waveform. The input text (in phonemes) is analyzed to locate a matching portion in the phoneme portion of the database. This matching utilizes exact equivalence of phonemes where this is possible; otherwise relation between phonemes is utilized. The selection process identifies input phonemes in context whereby improved conversions are obtained. Having analyzed the input exit into matching strings in the input form of the database beginning and ending parameters for the sections are established. The output text is produced by abutting sections of the digital waveform and defined by the beginning and ending parameters.
21 Citations
5 Claims
-
1. A database for use as a component of a speech engine said database comprising:
-
an output section containing an extended digital waveform, an access section containing signals representing said extended digital waveform in phonemes, and a common address parameter identifying common points in both sections, whereby identification of a segment of the signals representing said extended digital waveform in phonemes in the access section establishes beginning and ending values for the parameter and hence identifies the corresponding segment of the extended digital waveform. - View Dependent Claims (2, 3)
-
-
4. A database for use as a component of a speech engine said database comprising:
-
an output section containing an extended digital waveform, an access section containing signals representing said extended digital waveform in phonemes, and a common address parameter identifying common points in both sections. whereby identification of a segment of the signals representing said extended digital waveform in phonemes in the access section establishes beginning and ending values for the parameter and hence identifies the corresponding segment of the extended digital waveform; the access portion contains windows of five phoneme length, said access section having a hierarchical higher level accessed by the center phoneme of a window to identify the second and fourth phonemes of a windows whereby entries in the higher hierarchical level are equivalent to strings of three phonemes, and said access portion also comprises a lower hierarchical level accessed by a string of three phonemes to identify the first and fifth phonemes whereby entries in the lower hierarchical level are equivalent to strings of five phonemes. - View Dependent Claims (5)
-
Specification