METHOD, APPARATUS AND PROGRAM FOR SPEECH SYNTHESIS
1 Assignment
0 Petitions
Accused Products
Abstract
Apparatus and method for generating high quality synthesized speech having smooth waveform concatenation. The apparatus includes a pitch frequency calculation section, a pitch synchronization position calculation section, a unit waveform storage, a unit waveform selection section, a unit waveform generation section, and a waveform synthesis section. The unit waveform generation section includes a conversion ratio calculation section, a sampling rate conversion section, and a unit waveform re-selection section. The conversion ratio calculation section calculates a sampling rate conversion ratio from the pitch information and the position of pitch synchronization, and the sampling rate conversion section converts the sampling rate of the unit waveform, delivered as input, based on the sampling rate conversion ratio. The unit waveform re-selection section selects, from the sampling-rate-converted unit waveform, the unit waveform having a phase necessary to obtain a synthesized speech waveform which will exhibit smooth waveform concatenation.
-
Citations
59 Claims
-
1-34. -34. (canceled)
-
35. A speech synthesis apparatus for concatenating a plurality of unit waveforms to generate synthesized speech, said apparatus comprising:
-
a conversion section that converts sampling rate of said unit waveform; a decimation section that decimates the unit waveform that undergoes the conversion of the sampling rate to the sampling rate of a synthesized speech; and a waveform synthesis section that generates the synthesized speech using the decimated unit waveform; wherein said conversion section changes the conversion ratio of the sampling rate based on input prosodic information. - View Dependent Claims (36, 37)
-
-
38. A speech synthesis apparatus comprising:
-
a plurality of compressed unit waveform storages which store a plurality of compressed unit waveforms in association with conversion ratio of the sampling rate; a compressed unit waveform storage selection section that selects one of said compressed unit waveform storages, based on input prosodic information; a compressed unit waveform selection section that selects the compressed unit waveform from the selected one of said compressed unit waveform storage, based on said prosodic information and phonological information; a unit waveform decompression section that decompresses said compressed unit waveform to obtain the unit waveform, based on identification information of the selected compressed unit waveform storage; and a waveform synthesis section that generates the synthesized speech based on said prosodic information and the decompressed unit waveform. - View Dependent Claims (39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49)
-
-
50. A speech synthesis method for concatenating a plurality of unit waveforms to generate synthesized speech;
- said method comprising;
a step of performing conversion that increases sampling rate of said unit waveform; a step of decimating the unit waveform that undergoes the conversion of the sampling rate to the sampling rate of a synthesized speech; and a step of generating the synthesized speech using the decimated unit waveform; wherein said step of performing conversion changes the conversion ratio of the sampling rate based on input prosodic information. - View Dependent Claims (51, 52)
- said method comprising;
-
53. A speech synthesis method comprising:
-
a step of generating a plurality of compressed unit waveforms from a unit waveform storage in which unit waveforms are stored, and storing said compressed unit waveforms in a plurality of compressed unit waveform storages; a step of selecting one of said compressed unit waveform storages, based on the prosodic information; a step of selecting a compressed unit waveform, from the compressed unit waveform storage selected, based on the prosodic information and the phonological information; a step of decompressing the compressed unit waveform, based on the identification information of said unit waveform storage selected, to derive a unit waveform; and a step of generating the synthesized speech from said prosodic information and the decompressed unit waveform. - View Dependent Claims (54)
-
-
55. A program causing a computer, constituting a speech synthesis apparatus, to execute the processing of concatenating unit waveforms to generate a synthesized speech;
- wherein said program executes;
the processing of performing conversion that increases sampling rate of said unit waveform and changes the conversion ratio of the sampling rate based on input prosodic information; the processing of decimating the unit waveform that undergoes the conversion of the sampling rate to the sampling rate of a synthesized speech; and the processing of generating the synthesized speech using the decimated unit waveform. - View Dependent Claims (56, 57)
- wherein said program executes;
-
58. A program causing a computer, constituting a speech synthesis apparatus, to execute:
-
the processing of generating a plurality of compressed unit waveforms from a unit waveform storage in which unit waveforms are stored, and storing said compressed unit waveforms in a plurality of compressed unit waveform storages; the processing of selecting, based on the prosodic information, one of said compressed unit waveform storages; the processing of selecting a compressed unit waveform, from the compressed unit waveform storage selected, based on prosodic information and phonological information; the processing of decompressing the compressed unit waveform, based on the identification information of said unit waveform storage selected, to derive a unit waveform; and the processing of generating the synthesized speech from said prosodic information and the decompressed unit waveform. - View Dependent Claims (59)
-
Specification