Speech synthesizing method achieved by the segmentation of the linear Formant transition region
First Claim
1. A method for synthesizing speech through a synthesizer system including a personal computer (PC), a PC interface, a speech synthesizer, a digital-to-analog (D/A) converter, a key-board, a memory, and a speaker, the method comprising the steps of:
- (a) segmenting linear Formant information, corresponding to phoneme information, into linear Formant transition region segments;
(b) storing Formant frequency information and Formant bandwidth information for points of transition between consecutive ones of the linear Formant transition region segments of step (a), and lengths of the linear Formant transition region segments established by the segmenting in step (a), into a data base in a memory, for each phoneme information;
(c) inputting information subsequent to the storing in step (b), the input information designating speech sound to be synthesized;
(d) reading out stored Formant frequency information, Formant bandwidth information and length of the linear Formant transition region segments corresponding to the input information of step (c), from the data base stored in the memory;
(e) calculating a digital Formant contour, by linearly interpolating between the read out Formant frequency information and Formant bandwidth information corresponding to first and second consecutive points of transition corresponding to one of the linear Formant transition region segments of step (d), the interpolating being calculated over the read out length of the first linear Formant transition region segment;
(f) filtering the digital Formant contour, through a plurality of bandpass filters classified by a characteristic Formant, to produce a digital speech signal representative of a filtered glottal pulse; and
(g) converting the digital speech signal representative of the filtered glottal pulse into an analog speech signal through the D/A converter and outputting the analog speech signal.
0 Assignments
0 Petitions
Accused Products
Abstract
A way of a synthesizing speech by the combination of a Speech coding mode and Formant analysis mode is achieved by segmenting a Formant transition region into portions, according to the linear characteristics of a frequency curve, and storing the Formant information of each portion. Therefrom frequency information of a sound is obtained. Formant information data of a Formant contour to produce speech, is calculated by a linear interpolation method. The frequency and the bandwidth, which are elements of the Formant contour calculated by a linear interpolation method, are sequentially filtered in order to produce a speech signal which is a digital speech signal. The digital speech signal is converted to an analog signal, amplified, and output through a external speaker.
11 Citations
16 Claims
-
1. A method for synthesizing speech through a synthesizer system including a personal computer (PC), a PC interface, a speech synthesizer, a digital-to-analog (D/A) converter, a key-board, a memory, and a speaker, the method comprising the steps of:
-
(a) segmenting linear Formant information, corresponding to phoneme information, into linear Formant transition region segments; (b) storing Formant frequency information and Formant bandwidth information for points of transition between consecutive ones of the linear Formant transition region segments of step (a), and lengths of the linear Formant transition region segments established by the segmenting in step (a), into a data base in a memory, for each phoneme information; (c) inputting information subsequent to the storing in step (b), the input information designating speech sound to be synthesized; (d) reading out stored Formant frequency information, Formant bandwidth information and length of the linear Formant transition region segments corresponding to the input information of step (c), from the data base stored in the memory; (e) calculating a digital Formant contour, by linearly interpolating between the read out Formant frequency information and Formant bandwidth information corresponding to first and second consecutive points of transition corresponding to one of the linear Formant transition region segments of step (d), the interpolating being calculated over the read out length of the first linear Formant transition region segment; (f) filtering the digital Formant contour, through a plurality of bandpass filters classified by a characteristic Formant, to produce a digital speech signal representative of a filtered glottal pulse; and (g) converting the digital speech signal representative of the filtered glottal pulse into an analog speech signal through the D/A converter and outputting the analog speech signal. - View Dependent Claims (2, 3)
-
-
4. A method of processing speech, comprising the steps of:
-
(a) segmenting a speech frequency signal at points of transition into a plurality of time segments, each segment having a time length and each point of transition including at least one Formant of the speech frequency signal; (b) storing, for each Formant at each point of transition, one Formant frequency information and one bandwidth information; and (c) storing, for each segment, time length information corresponding to the time length of the segment obtained in said step (a). - View Dependent Claims (5, 6, 7, 8, 9)
-
-
10. A method of synthesizing speech, comprising the steps of:
-
(a) storing Formant information data for each of a plurality of Formants of a speech frequency signal, the Formant information data characterizing discrete points of transition between consecutive time segments of the speech frequency signal, the Formant information data including, for each point of transition, a single Formant frequency information and a single bandwidth information; (b) reading, for a first Formant, the stored Formant frequency information for a first point of transition and for a second point of transition; and (c) interpolating a plurality of frequency values between the read Formant frequency information of the first point of transition and the read Formant frequency information of the second point of transition. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
Specification