Methods for generating the voiced portion of speech signals
First Claim
1. A method for generating the voiced portion of a speech signal of the type generated by synthesis from voiced harmonics, the method comprising the steps of:
- receiving a signal containing information on a plurality of voiced harmonics, including information on first and second groups of said voiced harmonics;
generating said first group of voiced harmonics using a time domain synthesis method;
generating said second group of voiced harmonics using a frequency domain synthesis method; and
combining said generated first and second groups of voiced harmonics to produce said voiced portion of a speech signal.
0 Assignments
0 Petitions
Accused Products
Abstract
The pitch estimation method is improved. Sub-integer resolution pitch values are estimated in making the initial pitch estimate; the sub-integer pitch values are preferably estimated by interpolating intermediate variables between integer values. Pitch regions are used to reduce the amount of computation required in making the initial pitch estimate. Pitch-dependent resolution is used in making the initial pitch estimate, with higher resolution being used for smaller values of pitch. The accuracy of the voiced/unvoiced decision is improved by making the decision dependent on the energy of the current segment relative to the energy of recent prior segments; if the relative energy is low, the current segment favors an unvoiced decision; if high, it favors a voiced decision. Voiced harmonics are generated using a hybrid approach; some voiced harmonics are generated in the time domain, whereas the remaining harmonics are generated in the frequency domain; this preserves much of the computational savings of the frequency domain approach, while at the same time improving speech quality. Voiced harmonics generated in the frequency domin are generated with higher frequency accuracy; the harmonics are frequency sealed, transformed into the time domain with a Discrete Fourier Transform, interpolated and then time scaled.
69 Citations
9 Claims
-
1. A method for generating the voiced portion of a speech signal of the type generated by synthesis from voiced harmonics, the method comprising the steps of:
-
receiving a signal containing information on a plurality of voiced harmonics, including information on first and second groups of said voiced harmonics; generating said first group of voiced harmonics using a time domain synthesis method; generating said second group of voiced harmonics using a frequency domain synthesis method; and combining said generated first and second groups of voiced harmonics to produce said voiced portion of a speech signal. - View Dependent Claims (2, 3, 4, 5, 6, 8, 9)
-
-
7. A method for generating the voiced portion of a speech signal of the type generated by synthesis from voiced harmonics, the method comprising the steps of:
-
receiving a signal containing information on a plurality of voiced harmonics; linearly frequency scaling said information on said voiced harmonics according to the mapping ω
0 →
2π
/L, where L is some small integer, to generate frequency-scaled harmonics;performing an L-point Inverse Discrete Fourier Transform (DFT) to simultaneously transform said frequency scaled harmonics into the time domain; performing interpolation and time scaling to generate said plurality of voiced harmonics; and combining said voiced harmonics to produce said voiced portion of a speech signal.
-
Specification