Supporting a concatenative text-to-speech synthesis
First Claim
1. A method of generating a speech database as a basis for a concatenative text-to-speech synthesis, said method comprising:
- performing a speech processing including a segmental parametric speech encoding of speech data based on a parametric modeling of speech and resulting in compressed parameterized speech segments; and
assembling said compressed parameterized speech segments in a speech database.
2 Assignments
0 Petitions
Accused Products
Abstract
The invention relates to a support of a concatenative TTS synthesis. In order to generate a speech database as a basis for the TTS synthesis, first, a speech processing including a segmental parametric speech encoding of speech data based on a parametric modeling of speech is performed, which results in compressed parameterized speech segments. Then, the compressed parameterized speech segments are assembled in a speech database. In order to synthesize output speech, compressed parameterized speech segments are selected from the speech database based on an available text and decompressed to regain parameterized speech segments. The parameterized speech segments are then concatenated in a parameter domain. The output speech is synthesized based on these concatenated parametric speech segments.
-
Citations
25 Claims
-
1. A method of generating a speech database as a basis for a concatenative text-to-speech synthesis, said method comprising:
-
performing a speech processing including a segmental parametric speech encoding of speech data based on a parametric modeling of speech and resulting in compressed parameterized speech segments; and
assembling said compressed parameterized speech segments in a speech database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A database generator for generating a speech database as a basis for a concatenative text-to-speech synthesis, said database generator comprising:
-
processing means adapted to perform a speech processing including a segmental parametric speech encoding of speech data based on a parametric modeling of speech and resulting in compressed parameterized speech segments; and
processing means adapted to assemble said compressed parameterized speech segments in a speech database. - View Dependent Claims (10)
-
-
11. A software program product in which a software code for generating a speech database as a basis for a concatenative text-to-speech synthesis is stored, said software code realizing the following steps when being executed in a processing unit of an electronic device:
-
performing a speech processing including a segmental parametric speech encoding of speech data based on a parametric modeling of speech and resulting in compressed parameterized speech segments; and
assembling said compressed parameterized speech segments in a speech database.
-
-
12. A method enabling a concatenative text-to-speech synthesis based on a speech database comprising compressed parameterized speech segments obtained in a speech processing, said speech processing including a segmental parametric speech encoding of speech data using a parametric modeling of speech, said method comprising:
-
selecting compressed parameterized speech segments from said speech database based on an available text;
decompressing said selected compressed parameterized speech segments to regain parameterized speech segments;
concatenating said parameterized speech segments in a parameter domain; and
synthesizing output speech based on said concatenated parametric speech segments. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A text-to-speech synthesizer enabling a concatenative text-to-speech synthesis based on a speech database, said text-to-speech synthesizer comprising:
-
a memory storing a speech database comprising compressed parameterized speech segments obtained in a speech processing, said speech processing including a segmental parametric speech encoding of speech data using a parametric modeling of speech;
processing means adapted to select compressed parameterized speech segments from said speech database based on an available text;
processing means adapted to decompress said selected compressed parameterized speech segments to regain parameterized speech segments;
processing means adapted to concatenate said parameterized speech segments in a parameter domain; and
processing means adapted to synthesize output speech based on said concatenated parametric speech segments. - View Dependent Claims (23, 25)
-
-
24. A software program product in which a software code is stored on a readable medium, the software code for enabling a concatenative text-to-speech synthesis based on a speech database comprising compressed parameterized speech segments obtained in a speech processing, said speech processing including a segmental parametric speech encoding of speech data using a parametric modeling of speech, said software code realizing the following steps being executed in a processing unit of an electronic device:
-
selecting compressed parameterized speech segments from said speech database based on an available text;
decompressing said selected compressed parameterized speech segments to regain parameterized speech segments;
concatenating said parameterized speech segments in a parameter domain; and
synthesizing output speech based on said concatenated parametric speech segments.
-
Specification