Singing voice-synthesizing method and apparatus and storage medium

US 20060085196A1
Filed: 12/01/2005
Published: 04/20/2006
Est. Priority Date: 12/28/2000
Status: Active Grant

First Claim

Patent Images

1. A singing voice-synthesizing method comprising:

inputting phonetic unit information representative of a phonetic unit, time information representative of a singing-starting time point, and singing length information representative of a singing length, in timing earlier than the singing-starting time point, for a singing phonetic unit including a sequence of a first phoneme and a second phoneme;

generating a phonetic unit transition time length formed by a generation time length of the first phoneme and a generation time length of the second phoneme, based on the inputted phonetic unit information;

determining a singing-starting time point and a singing duration time of the first phoneme and a singing-starting time point and a singing duration time of the second phoneme, based on the generated phonetic unit transition time length, the inputted time information and singing length information; and

starting generation of a first singing voice and a second singing voice formed by the first phoneme and the second phoneme at the singing-starting time point of the first phoneme and the singing-starting time point of the second phoneme, respectively, and continuing generation of the first singing voice and the second singing voice for the singing duration time of the first phoneme and the singing duration time of the second phoneme, respectively.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

There are provided a singing voice-synthesizing method and apparatus capable of performing synthesis of natural singing voices close to human singing voices based on performance data being input in real time. Performance data is inputted for each phonetic unit constituting a lyric, to supply phonetic unit information, singing-starting time point information, singing length information, etc. Each performance data is inputted in timing earlier than the actual singing-starting time point, and a phonetic unit transition time length is generated. By using the phonetic unit transition time, the singing-starting time point information, and the singing length information, the singing-starting time points and singing duration times of the first and second phonemes are determined. In the singing voice synthesis, for each phoneme, a singing voice is generated at the determined singing-starting time point and continues to be generated for the determined singing duration time.

Citations

13 Claims

1. A singing voice-synthesizing method comprising:
- inputting phonetic unit information representative of a phonetic unit, time information representative of a singing-starting time point, and singing length information representative of a singing length, in timing earlier than the singing-starting time point, for a singing phonetic unit including a sequence of a first phoneme and a second phoneme;
  
  generating a phonetic unit transition time length formed by a generation time length of the first phoneme and a generation time length of the second phoneme, based on the inputted phonetic unit information;
  
  determining a singing-starting time point and a singing duration time of the first phoneme and a singing-starting time point and a singing duration time of the second phoneme, based on the generated phonetic unit transition time length, the inputted time information and singing length information; and
  
  starting generation of a first singing voice and a second singing voice formed by the first phoneme and the second phoneme at the singing-starting time point of the first phoneme and the singing-starting time point of the second phoneme, respectively, and continuing generation of the first singing voice and the second singing voice for the singing duration time of the first phoneme and the singing duration time of the second phoneme, respectively.
- View Dependent Claims (2)
- - 2. A singing voice-synthesizing method according to claim 1, wherein the determining includes setting the singing-starting time point of the first phoneme to a time point earlier than the singing-starting time point represented by the time information.

3. A singing voice-synthesizing apparatus comprising:
- an input section that inputs phonetic unit information representative of a phonetic unit, time information representative of a singing-starting time point, and singing length information representative of a singing length, in timing earlier than the singing-starting time point, for a phonetic unit including a sequence of a first phoneme and a second phoneme;
  
  a storage section that stores a phonetic unit transition time length formed by a generation time length of the first phoneme and a generation time length of the second phoneme;
  
  a readout section that reads out the phonetic unit transition time length from said storage section based on the phonetic unit information inputted by said input section;
  
  a calculating section that calculates a singing-staring time point and a singing duration time of the first phoneme, and a singing-starting time point and a singing duration time of the second phoneme, based on the phonetic unit transition time length read by said readout section and the time information and the singing length information which have been inputted by said input section; and
  
  a singing voice-synthesizing section that starts generation of a first singing voice and a second singing voice formed by the first phoneme and the second phoneme at the singing-starting time point of the first phoneme and the singing-starting time point of the second phoneme calculated by said calculating section, respectively, and continuing generation of the first singing voice and the second singing voice for the singing duration time of the first phoneme and the singing duration time of the second phoneme calculated by said calculating section, respectively.
- View Dependent Claims (4)
- - 4. A singing voice-synthesizing apparatus according to claim 3, wherein said input section inputs modifying information for modifying the generation time length of the first phoneme, and wherein said calculating section modifies the generation time length of the first phoneme in the phonetic unit transition time length read by said readout section according to the modifying information inputted by said input section, and then calculates the singing-starting time point and the singing duration time of the first phoneme and the singing-starting time point and the singing duration time of the second phoneme, based on the phonetic unit transition time length including the modified generation time length of the first phoneme.

5. (canceled)

6. (canceled)

7. (canceled)

8. (canceled)

9. (canceled)

10. (canceled)

11. A storage medium storing a program for executing a singing voice-synthesizing method, the program comprising:
- an input module that inputs phonetic unit information representative of a phonetic unit, time information representative of a singing-starting time point, and singing length information representative of a singing length, in timing earlier than the singing-starting time point, for a singing phonetic unit including a sequence of a first phoneme and a second phoneme;
  
  a phonetic unit transition time length-generating module that generates a phonetic unit transition time length formed by a generation time length of the first phoneme and a generation time length of the second phoneme, based on the inputted phonetic unit information;
  
  a determining module that determines a singing-starting time point and a singing duration time of the first phoneme and a singing-starting time point and a singing duration time of the second phoneme, based on the generated phonetic unit transition time length, the inputted time information and singing length information; and
  
  a singing voice-generating module that starts generation of a first singing voice and a second singing voice formed by the first phoneme and the second phoneme at the singing-starting time point of the first phoneme and the singing-starting time point of the second phoneme, respectively, and continuing generation of the first singing voice and the second singing voice for the singing duration time of the first phoneme and the singing duration time of the second phoneme, respectively.

12. (canceled)

13. A program code storage device comprising a storage medium and computer-readable program code, stored on said storage medium, having instructions which when executed cause:
- inputting phonetic unit information representative of a phonetic unit, time information representative of a singing-starting time point, and singing length information representative of a singing length, in timing earlier than the singing-starting time point, for a singing phonetic unit including a sequence of a first phoneme and a second phoneme;
  
  generating a phonetic unit transition time length formed by a generation time length of the first phoneme and a generation time length of the second phoneme, based on the inputted phonetic unit information;
  
  determining a singing-starting time point and a singing duration time of the first phoneme and a singing-starting time point and a singing duration time of the second phoneme, based on the generated phonetic unit transition time length, the inputted time information and singing length information; and
  
  initiating generation of a first singing voice and a second singing voice formed by the first phoneme and the second phoneme at the singing-starting time point of the first phoneme and the singing-starting time point of the second phoneme, respectively, and continuing generation of the first singing voice and the second singing voice for the singing duration time of the first phoneme and the singing duration time of the second phoneme, respectively.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Yamaha Corporation
Original Assignee
Yamaha Corporation
Inventors
Kayama, Hiraku, Celma, Oscar, Ortola, Jaume

Granted Patent

US 7,249,022 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/267
CPC Class Codes

G10H 2210/201   Vibrato, i.e. rapid, repeti...

G10H 2250/455   Gensound singing voices, i....

G10L 13/02   Methods for producing synth...

G10L 13/033   Voice editing, e.g. manipul...

G10L 13/06   Elementary speech units use...

Singing voice-synthesizing method and apparatus and storage medium

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

Citations

13 Claims

Specification

Solutions

Use Cases

Quick Links

Singing voice-synthesizing method and apparatus and storage medium

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

13 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links