Voice synthesis apparatus using a plurality of phonetic piece data
First Claim
1. An apparatus for synthesizing a voice signal using a plurality of phonetic piece data each indicating a phonetic piece which contains at least two phoneme sections corresponding to different phonemes, the apparatus comprising;
- a voice synthesis processor configured to produce a voice signal for output of a sound wave from a sound output unit, wherein the voice synthesis processor includesa phonetic piece adjustment part that forms a target section from a first phonetic piece and a second phonetic piece so as to connect the first phonetic piece and the second phonetic piece to each other such that the target section is formed of a rear phoneme section of the first phonetic piece corresponding to a consonant phoneme and a front phoneme section of the second phonetic piece corresponding to the consonant phoneme, and that carries out an expansion process for expanding the target section by a target time length to form an adjustment section such that a central part of the target section is expanded at an expansion rate higher than that of a front part and a rear part of the target section, to thereby create synthesized phonetic piece data of the adjustment section having the target time length and corresponding to the consonant phoneme; and
a voice synthesis part that creates a voice signal from the synthesized phonetic piece data created by the phonetic piece adjustment part,wherein the phonetic piece adjustment part carries out the expansion process in case that the consonant phoneme of the target section belongs to one type including fricative sound and semivowel sound, and carries out another expansion process in case that the consonant phoneme of the target section belongs to another type including plosive sound, affricate sound, nasal sound and liquid sound for inserting an intermediate section between the rear phoneme section of the first phonetic piece and the front phoneme section of the second phonetic piece in the target section.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice signal is synthesized using a plurality of phonetic piece data each indicating a phonetic piece containing at least two phoneme sections corresponding to different phonemes. In the apparatus, a phonetic piece adjustor forms a target section from first and second phonetic pieces so as to connect the first and second phonetic pieces to each other such that the target section includes a rear phoneme section of the first piece and a front phoneme section of the second piece, and expands the target section by a target time length to form an adjustment section such that a central part is expanded at an expansion rate higher than that of front and rear parts of the target section, to thereby create synthesized phonetic piece data having the target time length. A voice synthesizer creates a voice signal from the synthesized phonetic piece data.
25 Citations
6 Claims
-
1. An apparatus for synthesizing a voice signal using a plurality of phonetic piece data each indicating a phonetic piece which contains at least two phoneme sections corresponding to different phonemes, the apparatus comprising;
-
a voice synthesis processor configured to produce a voice signal for output of a sound wave from a sound output unit, wherein the voice synthesis processor includes a phonetic piece adjustment part that forms a target section from a first phonetic piece and a second phonetic piece so as to connect the first phonetic piece and the second phonetic piece to each other such that the target section is formed of a rear phoneme section of the first phonetic piece corresponding to a consonant phoneme and a front phoneme section of the second phonetic piece corresponding to the consonant phoneme, and that carries out an expansion process for expanding the target section by a target time length to form an adjustment section such that a central part of the target section is expanded at an expansion rate higher than that of a front part and a rear part of the target section, to thereby create synthesized phonetic piece data of the adjustment section having the target time length and corresponding to the consonant phoneme; and a voice synthesis part that creates a voice signal from the synthesized phonetic piece data created by the phonetic piece adjustment part, wherein the phonetic piece adjustment part carries out the expansion process in case that the consonant phoneme of the target section belongs to one type including fricative sound and semivowel sound, and carries out another expansion process in case that the consonant phoneme of the target section belongs to another type including plosive sound, affricate sound, nasal sound and liquid sound for inserting an intermediate section between the rear phoneme section of the first phonetic piece and the front phoneme section of the second phonetic piece in the target section. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of synthesizing a voice signal using a processor configured to process a plurality of phonetic piece data each indicating a phonetic piece which contains at least two phoneme sections corresponding to different phonemes and outputting the voice signal in the form of a sound wave from a sound output unit, the method comprising the acts of;
-
forming using the processor a target section from a first phonetic piece and a second phonetic piece so as to connect the first phonetic piece and the second phonetic piece to each other such that the target section is formed of a rear phoneme section of the first phonetic piece corresponding to a consonant phoneme and a front phoneme section of the second phonetic piece corresponding to the consonant phoneme; carrying out an expansion process for expanding the target section by a target time length to form an adjustment section such that a central part of the target section is expanded at an expansion rate higher than that of a front part and a rear part of the target section, to thereby create synthesized phonetic piece data of the adjustment section having the target time length and corresponding to the consonant phoneme; creating a voice signal from the synthesized phonetic piece data created by the phonetic piece adjustment part, and forwarding the voice signal to a sound output unit for generating a sound wave corresponding to the voice signal, wherein the phonetic piece data comprises a plurality of unit data corresponding to a plurality of frames arranged on a time axis, wherein in case that the target section corresponds to an consonant phoneme of the target section belongs to one type including fricative sound and semivowel sound, and carries out another expansion process in case that the consonant phoneme of the target section belongs to another type including plosive sound, affricate sound, nasal sound and liquid sound for inserting an intermediate section between the rear phoneme section of the first phonetic piece and the front phoneme section of the second phonetic piece in the target section, and wherein velocity, at which each frame in the target section corresponding to each frame in the adjustment section is changed according to passage of time in the adjustment section, is decreased from a front part to a central point of the adjustment section and increased from the central point to a rear part of the adjustment section.
-
Specification