Speech synthesis with weighted parameters at phoneme boundaries
First Claim
1. A method of speech synthesis comprising the steps of:
- determining a set of control parameters required for the control of the synthesis of the speech;
storing said control parameters in either a matrix or as a sequence list of each polyphone;
defining a behavior of a given control parameter with respect to a time period around each phoneme boundary;
weighting each of said matrix or sequence list by an individual weight function;
forming a weighted mean value for joining polyphones by multiplication by a cosine function;
joining polyphones by use of said weighted mean values which are defined by associating two matrices or sequence lists;
matching a duration of each phoneme to a neighboring polyphone by quantizing the duration for one parameter sampling interval; and
synthesizing a speech signal from said phonemes.
2 Assignments
0 Petitions
Accused Products
Abstract
The invention relates to a method and an arrangement for speech synthesis and provides an automatic mechanism for simulating human speech. The method provides a number of control parameters for controlling a speech synthesis device. The invention solves the problem of coarticulation by using an interpolation mechanism. The control parameters are stored in a matrix or a sequence list for each polyphone. The behaviour of the respective parameter with time is defined around each phoneme boundary and polyphones are joined by forming a weighted mean value of the curves which are defined by their two associated matrices/sequences list. The invention also provides an arrangement for carrying out the method.
18 Citations
2 Claims
-
1. A method of speech synthesis comprising the steps of:
-
determining a set of control parameters required for the control of the synthesis of the speech; storing said control parameters in either a matrix or as a sequence list of each polyphone; defining a behavior of a given control parameter with respect to a time period around each phoneme boundary; weighting each of said matrix or sequence list by an individual weight function; forming a weighted mean value for joining polyphones by multiplication by a cosine function; joining polyphones by use of said weighted mean values which are defined by associating two matrices or sequence lists; matching a duration of each phoneme to a neighboring polyphone by quantizing the duration for one parameter sampling interval; and synthesizing a speech signal from said phonemes. - View Dependent Claims (2)
-
Specification