Singing voice synthesizing apparatus, singing voice synthesizing method, and program for realizing singing voice synthesizing method
First Claim
1. A singing voice synthesizing apparatus comprising:
- a phoneme database that stores a plurality of voice fragment data formed of voice fragments each being a single phoneme or a phoneme chain of at least two concatenated phonemes, each of the plurality of voice fragment data comprising data of a deterministic component and data of a stochastic component;
an input device that inputs lyrics;
a readout device that reads out from said phoneme database the voice fragment data corresponding to the inputted lyrics;
a duration time adjusting device that adjusts time duration of the read-out voice fragment data so as to match a desired tempo and manner of singing;
an adjusting device that adjusts the deterministic component and the stochastic component of the read-out voice fragment so as to match a desired pitch; and
a synthesizing device that synthesizes a singing sound by sequentially concatenating the voice fragment data that have been adjusted by said duration time adjusting device and said adjusting device.
1 Assignment
0 Petitions
Accused Products
Abstract
A singing voice synthesizing apparatus is provided, which enables achievement of a natural sounding synthesized singing voice with a good level of comprehensibility. A phoneme database stores a plurality of voice fragment data formed of voice fragments each being a single phoneme or a phoneme chain of at least two concatenated phonemes, each of the plurality of voice fragment data comprising data of a deterministic component and data of a stochastic component. A readout device that reads out from the phoneme database the voice fragment data corresponding to inputted lyrics. A duration time adjusting device adjusts time duration of the read-out voice fragment data so as to match a desired tempo and manner of singing. An adjusting device adjusts the deterministic component and the stochastic component of the read-out voice fragment so as to match a desired pitch. A synthesizing device synthesizes a singing sound by sequentially concatenating the voice fragment data that have been adjusted by the duration time adjusting device and the adjusting device.
-
Citations
17 Claims
-
1. A singing voice synthesizing apparatus comprising:
-
a phoneme database that stores a plurality of voice fragment data formed of voice fragments each being a single phoneme or a phoneme chain of at least two concatenated phonemes, each of the plurality of voice fragment data comprising data of a deterministic component and data of a stochastic component;
an input device that inputs lyrics;
a readout device that reads out from said phoneme database the voice fragment data corresponding to the inputted lyrics;
a duration time adjusting device that adjusts time duration of the read-out voice fragment data so as to match a desired tempo and manner of singing;
an adjusting device that adjusts the deterministic component and the stochastic component of the read-out voice fragment so as to match a desired pitch; and
a synthesizing device that synthesizes a singing sound by sequentially concatenating the voice fragment data that have been adjusted by said duration time adjusting device and said adjusting device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A singing voice synthesizing method comprising the steps of:
-
storing in a phoneme database a plurality of voice fragment data formed of voice fragments each being a single phoneme or a phoneme chain of at least two concatenated phonemes, each of said plurality of voice fragment data comprising data of a deterministic component and data of a stochastic component;
reading out from said phoneme database the voice fragment data corresponding to lyrics inputted by an input device;
adjusting time duration of the read-out voice fragment data so as to match a desired tempo and manner of singing;
adjusting the deterministic component and the stochastic component of the read-out voice fragment so as to match a desired pitch; and
synthesizing a singing sound by sequentially concatenating the voice fragment data that have been adjusted in respect of the time duration and the deterministic component and the stochastic component thereof.
-
-
16. A program for causing a computer to execute a singing voice synthesizing method comprising the steps of:
-
storing in a phoneme database a plurality of voice fragment data formed of voice fragments each being a single phoneme or a phoneme chain of at least two concatenated phonemes, each of said plurality of voice fragment data comprising data of a deterministic component and data of a stochastic component;
reading out from said phoneme database the voice fragment data corresponding to lyrics inputted by an input device;
adjusting time duration of the read-out voice fragment data so as to match a desired tempo and manner of singing;
adjusting the deterministic component and the stochastic component of the read-out voice fragment so as to match a desired pitch; and
synthesizing a singing sound by sequentially concatenating the voice fragment data that have been adjusted in respect of the time duration and the deterministic component and the stochastic component thereof.
-
-
17. A mechanically readable storage medium storing instructions for causing a machine to execute a singing voice synthesizing method comprising the steps of:
-
storing in a phoneme database a plurality of voice fragment data formed of voice fragments each being a single phoneme or a phoneme chain of at least two concatenated phonemes, each of said plurality of voice fragment data comprising data of a deterministic component and data of a stochastic component;
reading out from said phoneme database the voice fragment data corresponding to lyrics inputted by an input device;
adjusting time duration of the read-out voice fragment data so as to match a desired tempo and manner of singing;
adjusting the deterministic component and the stochastic component of the read-out voice fragment so as to match a desired pitch; and
synthesizing a singing sound by sequentially concatenating the voice fragment data that have been adjusted in respect of the time duration and the deterministic component and the stochastic component thereof.
-
Specification