Singing voice-synthesizing method and apparatus and storage medium
First Claim
1. A singing voice-synthesizing method comprising:
- inputting phonetic unit information representative of a phonetic unit, time information representative of a singing-starting time point, and singing length information representative of a singing length, in timing earlier than the singing-starting time point, for a singing phonetic unit including a sequence of a first phoneme and a second phoneme;
generating a phonetic unit transition time length formed by a generation time length of the first phoneme and a generation time length of the second phoneme, based on the inputted phonetic unit information;
determining a singing-starting time point and a singing duration time of the first phoneme and a singing-starting time point and a singing duration time of the second phoneme, based on the generated phonetic unit transition time length, the inputted time information and singing length information; and
starting generation of a first singing voice and a second singing voice formed by the first phoneme and the second phoneme at the singing-starting time point of the first phoneme and the singing-starting time point of the second phoneme, respectively, and continuing generation of the first singing voice and the second singing voice for the singing duration time of the first phoneme and the singing duration time of the second phoneme, respectively.
0 Assignments
0 Petitions
Accused Products
Abstract
There are provided a singing voice-synthesizing method and apparatus capable of performing synthesis of natural singing voices close to human singing voices based on performance data being input in real time. Performance data is inputted for each phonetic unit constituting a lyric, to supply phonetic unit information, singing-starting time point information, singing length information, etc. Each performance data is inputted in timing earlier than the actual singing-starting time point, and a phonetic unit transition time length is generated. By using the phonetic unit transition time, the singing-starting time point information, and the singing length information, the singing-starting time points and singing duration times of the first and second phonemes are determined. In the singing voice synthesis, for each phoneme, a singing voice is generated at the determined singing-starting time point and continues to be generated for the determined singing duration time.
-
Citations
13 Claims
-
1. A singing voice-synthesizing method comprising:
-
inputting phonetic unit information representative of a phonetic unit, time information representative of a singing-starting time point, and singing length information representative of a singing length, in timing earlier than the singing-starting time point, for a singing phonetic unit including a sequence of a first phoneme and a second phoneme;
generating a phonetic unit transition time length formed by a generation time length of the first phoneme and a generation time length of the second phoneme, based on the inputted phonetic unit information;
determining a singing-starting time point and a singing duration time of the first phoneme and a singing-starting time point and a singing duration time of the second phoneme, based on the generated phonetic unit transition time length, the inputted time information and singing length information; and
starting generation of a first singing voice and a second singing voice formed by the first phoneme and the second phoneme at the singing-starting time point of the first phoneme and the singing-starting time point of the second phoneme, respectively, and continuing generation of the first singing voice and the second singing voice for the singing duration time of the first phoneme and the singing duration time of the second phoneme, respectively. - View Dependent Claims (2)
-
-
3. A singing voice-synthesizing apparatus comprising:
-
an input section that inputs phonetic unit information representative of a phonetic unit, time information representative of a singing-starting time point, and singing length information representative of a singing length, in timing earlier than the singing-starting time point, for a phonetic unit including a sequence of a first phoneme and a second phoneme;
a storage section that stores a phonetic unit transition time length formed by a generation time length of the first phoneme and a generation time length of the second phoneme;
a readout section that reads out the phonetic unit transition time length from said storage section based on the phonetic unit information inputted by said input section;
a calculating section that calculates a singing-staring time point and a singing duration time of the first phoneme, and a singing-starting time point and a singing duration time of the second phoneme, based on the phonetic unit transition time length read by said readout section and the time information and the singing length information which have been inputted by said input section; and
a singing voice-synthesizing section that starts generation of a first singing voice and a second singing voice formed by the first phoneme and the second phoneme at the singing-starting time point of the first phoneme and the singing-starting time point of the second phoneme calculated by said calculating section, respectively, and continuing generation of the first singing voice and the second singing voice for the singing duration time of the first phoneme and the singing duration time of the second phoneme calculated by said calculating section, respectively. - View Dependent Claims (4)
-
-
5. (canceled)
-
6. (canceled)
-
7. (canceled)
-
8. (canceled)
-
9. (canceled)
-
10. (canceled)
-
11. A storage medium storing a program for executing a singing voice-synthesizing method, the program comprising:
-
an input module that inputs phonetic unit information representative of a phonetic unit, time information representative of a singing-starting time point, and singing length information representative of a singing length, in timing earlier than the singing-starting time point, for a singing phonetic unit including a sequence of a first phoneme and a second phoneme;
a phonetic unit transition time length-generating module that generates a phonetic unit transition time length formed by a generation time length of the first phoneme and a generation time length of the second phoneme, based on the inputted phonetic unit information;
a determining module that determines a singing-starting time point and a singing duration time of the first phoneme and a singing-starting time point and a singing duration time of the second phoneme, based on the generated phonetic unit transition time length, the inputted time information and singing length information; and
a singing voice-generating module that starts generation of a first singing voice and a second singing voice formed by the first phoneme and the second phoneme at the singing-starting time point of the first phoneme and the singing-starting time point of the second phoneme, respectively, and continuing generation of the first singing voice and the second singing voice for the singing duration time of the first phoneme and the singing duration time of the second phoneme, respectively.
-
-
12. (canceled)
-
13. A program code storage device comprising a storage medium and computer-readable program code, stored on said storage medium, having instructions which when executed cause:
-
inputting phonetic unit information representative of a phonetic unit, time information representative of a singing-starting time point, and singing length information representative of a singing length, in timing earlier than the singing-starting time point, for a singing phonetic unit including a sequence of a first phoneme and a second phoneme;
generating a phonetic unit transition time length formed by a generation time length of the first phoneme and a generation time length of the second phoneme, based on the inputted phonetic unit information;
determining a singing-starting time point and a singing duration time of the first phoneme and a singing-starting time point and a singing duration time of the second phoneme, based on the generated phonetic unit transition time length, the inputted time information and singing length information; and
initiating generation of a first singing voice and a second singing voice formed by the first phoneme and the second phoneme at the singing-starting time point of the first phoneme and the singing-starting time point of the second phoneme, respectively, and continuing generation of the first singing voice and the second singing voice for the singing duration time of the first phoneme and the singing duration time of the second phoneme, respectively.
-
Specification