Speech synthesizing method and apparatus for combining natural speech segments and synthesized speech segments
First Claim
1. A speech synthesizing method characterized by:
- storing natural speech segments prepared by cutting out prerecorded speech waveforms in each specific syllable chain, by a natural speech segment memory unit,storing speech segments which have been previously prepared bydividing N-dimensional space S, N being a positive integer, built up by a parameter vector P composed of N parameters into M regions AO to AM-1, M being a positive integer, and generates a parameter vector Pi corresponding to a desired position in a region Ai for all integers i changing from 0 to M-1, andgenerating a synthesized waveform according to the parameter vector Pi, andsynthesizing speech while connecting the natural speech segments and synthesized speech segments, in a connection synthesis unit.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for synthesizing speech. According to one variation of the method and apparatus, a plurality of speech segment data units is prepared for all desired speech waveforms. Speech is then synthesized by reading out from memory the appropriate speech segment data units, and a desired pitch is obtained by overlapping the appropriate speech segment data units according to a pitch period interval. According to a second variation of the method and apparatus, speech segment data units are prepared for only initial speech waveforms and first pitch waveforms, and differential waveforms. With this variation, subsequent pitch waveforms for speech synthesis are generated by combining the first pitch waveform with the corresponding differential waveform. According to a third variation of the method and apparatus, a natural speech segment channel produces natural speech segment data units in the same manner as the first variation, and a synthesized speech segment channel produces speech segment data units according to a parameter method, such as a formant method. The natural speech segments and synthesized speech segments are then mixed to produce synthesized speech.
68 Citations
8 Claims
-
1. A speech synthesizing method characterized by:
-
storing natural speech segments prepared by cutting out prerecorded speech waveforms in each specific syllable chain, by a natural speech segment memory unit, storing speech segments which have been previously prepared by dividing N-dimensional space S, N being a positive integer, built up by a parameter vector P composed of N parameters into M regions AO to AM-1, M being a positive integer, and generates a parameter vector Pi corresponding to a desired position in a region Ai for all integers i changing from 0 to M-1, and generating a synthesized waveform according to the parameter vector Pi, and synthesizing speech while connecting the natural speech segments and synthesized speech segments, in a connection synthesis unit. - View Dependent Claims (2, 3, 4)
-
-
5. A speech synthesizing apparatus comprising a synthesized speech segment memory unit for storing natural speech segments prepared by cutting out prerecorded speech waveforms in each specific syllable chain,
a natural speech segment memory unit for storing speech segments prepared by the speech segment preparing method of claim 23, and a connection synthesis unit for synthesizing speech while connecting the natural speech segments and synthesized speech segments.
Specification