Apparatus for synthesizing speech by varying pitch
First Claim
1. A speech synthesis apparatus including means controllable to vary a pitch of speech signals synthesized thereby, having:
- (i) means for separating the speech signals into a spectral component and an excitation component;
(ii) means for multiplying the excitation component by a series of overlapping window functions synchronous, in the case of voiced speech, with pitch timing mark information corresponding at least approximately to instants of vocal excitation, to separate it into windowed segments;
(iii) means to apply a controllable time-shift to the segments and add the time-shifted segments together; and
(iv) means for recombining the spectral and excitation components;
wherein the multiplying means employs at least two windows per pitch period, each having a duration of less than one pitch period.
1 Assignment
0 Petitions
Accused Products
Abstract
The pitch of synthesized speech signals is varied by separating the speech signals into a spectral component and an excitation component. The latter is multiplied by a series of overlapping window functions synchronous, in the case of voiced speech, with pitch timing mark information corresponding at least approximately to instants of vocal excitation, to separate it into windowed speech segments which are added together again after the application of a controllable time-shift. The spectral and excitation components are then recombined. The multiplication employs at least two windows per pitch period, each having a duration of less than one pitch period. Alternatively each window has a duration of less than twice the pitch period between timing marks and is asymmetric about the timing mark.
30 Citations
21 Claims
-
1. A speech synthesis apparatus including means controllable to vary a pitch of speech signals synthesized thereby, having:
-
(i) means for separating the speech signals into a spectral component and an excitation component; (ii) means for multiplying the excitation component by a series of overlapping window functions synchronous, in the case of voiced speech, with pitch timing mark information corresponding at least approximately to instants of vocal excitation, to separate it into windowed segments; (iii) means to apply a controllable time-shift to the segments and add the time-shifted segments together; and (iv) means for recombining the spectral and excitation components; wherein the multiplying means employs at least two windows per pitch period, each having a duration of less than one pitch period. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 14, 15, 16)
-
-
11. A speech synthesis apparatus including means controllable to vary a pitch of speech signals synthesized thereby, having:
-
(i) means for separating the speech signals into a spectral component and an excitation component; (ii) means for controlling pitch of the excitation component by repeating or omitting pitch periods thereof and, respectively, temporally compressing or expanding said component by interpolating new signal samples from input signal samples; and (iii) means for recombining the spectral and excitation components. - View Dependent Claims (13)
-
- 17. A speech synthesis apparatus including means for controlling a pitch of an input signal by multiplying the signal by a series of overlapping windows to separate it into segments and recombining the segments after subjecting the segments to a time shift, the windows being synchronous with timing marks representing instants of peak vocal excitation, wherein each window has a duration of less than twice a pitch period between timing marks and is asymmetric about the timing mark.
Specification