Voice synthesis apparatus using a plurality of phonetic piece data

US 9,230,537 B2
Filed: 05/31/2012
Issued: 01/05/2016
Est. Priority Date: 06/01/2011
Status: Active Grant

First Claim

Patent Images

1. An apparatus for synthesizing a voice signal using a plurality of phonetic piece data each indicating a phonetic piece which contains at least two phoneme sections corresponding to different phonemes, the apparatus comprising;

a voice synthesis processor configured to produce a voice signal for output of a sound wave from a sound output unit, wherein the voice synthesis processor includesa phonetic piece adjustment part that forms a target section from a first phonetic piece and a second phonetic piece so as to connect the first phonetic piece and the second phonetic piece to each other such that the target section is formed of a rear phoneme section of the first phonetic piece corresponding to a consonant phoneme and a front phoneme section of the second phonetic piece corresponding to the consonant phoneme, and that carries out an expansion process for expanding the target section by a target time length to form an adjustment section such that a central part of the target section is expanded at an expansion rate higher than that of a front part and a rear part of the target section, to thereby create synthesized phonetic piece data of the adjustment section having the target time length and corresponding to the consonant phoneme; and

a voice synthesis part that creates a voice signal from the synthesized phonetic piece data created by the phonetic piece adjustment part,wherein the phonetic piece adjustment part carries out the expansion process in case that the consonant phoneme of the target section belongs to one type including fricative sound and semivowel sound, and carries out another expansion process in case that the consonant phoneme of the target section belongs to another type including plosive sound, affricate sound, nasal sound and liquid sound for inserting an intermediate section between the rear phoneme section of the first phonetic piece and the front phoneme section of the second phonetic piece in the target section.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A voice signal is synthesized using a plurality of phonetic piece data each indicating a phonetic piece containing at least two phoneme sections corresponding to different phonemes. In the apparatus, a phonetic piece adjustor forms a target section from first and second phonetic pieces so as to connect the first and second phonetic pieces to each other such that the target section includes a rear phoneme section of the first piece and a front phoneme section of the second piece, and expands the target section by a target time length to form an adjustment section such that a central part is expanded at an expansion rate higher than that of front and rear parts of the target section, to thereby create synthesized phonetic piece data having the target time length. A voice synthesizer creates a voice signal from the synthesized phonetic piece data.

25 Citations

6 Claims

1. An apparatus for synthesizing a voice signal using a plurality of phonetic piece data each indicating a phonetic piece which contains at least two phoneme sections corresponding to different phonemes, the apparatus comprising;
- a voice synthesis processor configured to produce a voice signal for output of a sound wave from a sound output unit, wherein the voice synthesis processor includesa phonetic piece adjustment part that forms a target section from a first phonetic piece and a second phonetic piece so as to connect the first phonetic piece and the second phonetic piece to each other such that the target section is formed of a rear phoneme section of the first phonetic piece corresponding to a consonant phoneme and a front phoneme section of the second phonetic piece corresponding to the consonant phoneme, and that carries out an expansion process for expanding the target section by a target time length to form an adjustment section such that a central part of the target section is expanded at an expansion rate higher than that of a front part and a rear part of the target section, to thereby create synthesized phonetic piece data of the adjustment section having the target time length and corresponding to the consonant phoneme; and
  
  a voice synthesis part that creates a voice signal from the synthesized phonetic piece data created by the phonetic piece adjustment part,wherein the phonetic piece adjustment part carries out the expansion process in case that the consonant phoneme of the target section belongs to one type including fricative sound and semivowel sound, and carries out another expansion process in case that the consonant phoneme of the target section belongs to another type including plosive sound, affricate sound, nasal sound and liquid sound for inserting an intermediate section between the rear phoneme section of the first phonetic piece and the front phoneme section of the second phonetic piece in the target section.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The apparatus according to claim 1, wherein the phonetic piece adjustment part inserts a silence section as the intermediate section between the rear phoneme section of the first phonetic piece and the front phoneme section of the second phonetic piece in case that the consonant phoneme of the target section is plosive sound or affricate sound.
  - 3. The apparatus according to claim 1, wherein the phonetic piece adjustment part inserts the intermediate section containing repetition of a frame selected from the rear phoneme section of the first phonetic piece or the front phoneme section of the second phonetic piece in case that the consonant phoneme of the target section is nasal sound or liquid sound.
  - 4. The apparatus according to claim 3, wherein the phonetic piece adjustment part inserts the intermediate section containing repetition of the last frame of the rear phoneme section of the first phonetic piece.
  - 5. The apparatus according to claim 3, wherein the phonetic piece adjustment part inserts the intermediate section containing repetition of the top frame of the front phoneme section of the second phonetic piece.

6. A method of synthesizing a voice signal using a processor configured to process a plurality of phonetic piece data each indicating a phonetic piece which contains at least two phoneme sections corresponding to different phonemes and outputting the voice signal in the form of a sound wave from a sound output unit, the method comprising the acts of;
- forming using the processor a target section from a first phonetic piece and a second phonetic piece so as to connect the first phonetic piece and the second phonetic piece to each other such that the target section is formed of a rear phoneme section of the first phonetic piece corresponding to a consonant phoneme and a front phoneme section of the second phonetic piece corresponding to the consonant phoneme;
  
  carrying out an expansion process for expanding the target section by a target time length to form an adjustment section such that a central part of the target section is expanded at an expansion rate higher than that of a front part and a rear part of the target section, to thereby create synthesized phonetic piece data of the adjustment section having the target time length and corresponding to the consonant phoneme;
  
  creating a voice signal from the synthesized phonetic piece data created by the phonetic piece adjustment part, andforwarding the voice signal to a sound output unit for generating a sound wave corresponding to the voice signal,wherein the phonetic piece data comprises a plurality of unit data corresponding to a plurality of frames arranged on a time axis,wherein in case that the target section corresponds to an consonant phoneme of the target section belongs to one type including fricative sound and semivowel sound, and carries out another expansion process in case that the consonant phoneme of the target section belongs to another type including plosive sound, affricate sound, nasal sound and liquid sound for inserting an intermediate section between the rear phoneme section of the first phonetic piece and the front phoneme section of the second phonetic piece in the target section, andwherein velocity, at which each frame in the target section corresponding to each frame in the adjustment section is changed according to passage of time in the adjustment section, is decreased from a front part to a central point of the adjustment section and increased from the central point to a rear part of the adjustment section.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Yamaha Corporation
Original Assignee
Yamaha Corporation
Inventors
Saino, Keijiro
Primary Examiner(s)
Desir, Pierre-Louis
Assistant Examiner(s)
KOVACEK, DAVID M

Application Number

US13/485,303
Publication Number

US 20120310651A1
Time in Patent Office

1,314 Days
Field of Search

704200-201, 704258-269, 704/270, 704/278, 704500-501, 704E19001-E19049, 704E13001-E13014
US Class Current

1/1
CPC Class Codes

G10L 13/033   Voice editing, e.g. manipul...

G10L 13/07   Concatenation rules

G10L 21/049   characterised by the interc...

G10L 25/93   Discriminating between voic...

Voice synthesis apparatus using a plurality of phonetic piece data

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

25 Citations

6 Claims

Specification

Use Cases

Quick Links

Others

Voice synthesis apparatus using a plurality of phonetic piece data

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

25 Citations

6 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others