SPEECH SYNTHESIZING DEVICE, COMPUTER PROGRAM PRODUCT, AND METHOD

US 20100250254A1
Filed: 09/15/2009
Published: 09/30/2010
Est. Priority Date: 03/25/2009
Status: Active Grant

First Claim

Patent Images

1. A speech synthesizing device comprising:

an acquiring unit configured to acquire a plurality of pattern sentences, which are similar to one another and each include a fixed segment and a non-fixed segment, and a substitution word, the fixed segment is not to be replaced with any other word, the non-fixed segment is to be replaced with another word, the substitution word is substituted for the non-fixed segment;

a sentence generating unit configured to generate a plurality of target sentences by replacing the non-fixed segment with the substitution word for each of the pattern sentences;

a first synthetic-sound generating unit configured to generate a first synthetic sound, which is a synthetic sound of the fixed segment, for each of the target sentences;

a second synthetic-sound generating unit configured to generate a second synthetic sound, which is a synthetic sound of the substitution word, for each of the target sentences;

a calculating unit configured to calculate a discontinuity value of a boundary between the first synthetic sound and the second synthetic sound, for each of the target sentences;

a selecting unit configured to select one of the target sentences having the smallest discontinuity value from the target sentences; and

a connecting unit configured to connect the first synthetic sound and the second synthetic sound of the target sentence selected.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

An acquiring unit acquires pattern sentences, which are similar to one another and include fixed segments and non-fixed segments, and substitution words that are substituted for the non-fixed segments. A sentence generating unit generates target sentences by replacing the non-fixed segments with the substitution words for each of the pattern sentences. A first synthetic-sound generating unit generates a first synthetic sound, a synthetic sound of the fixed segment, and a second synthetic-sound generating unit generates a second synthetic sound, a synthetic sound of the substitution word, for each of the target sentences. A calculating unit calculates a discontinuity value of a boundary between the first synthetic sound and the second synthetic sound for each of the target sentences and a selecting unit selects the target sentence having the smallest discontinuity value. A connecting unit connects the first synthetic sound and the second synthetic sound of the target sentence selected.

16 Citations

View as Search Results

9 Claims

1. A speech synthesizing device comprising:
- an acquiring unit configured to acquire a plurality of pattern sentences, which are similar to one another and each include a fixed segment and a non-fixed segment, and a substitution word, the fixed segment is not to be replaced with any other word, the non-fixed segment is to be replaced with another word, the substitution word is substituted for the non-fixed segment;
  
  a sentence generating unit configured to generate a plurality of target sentences by replacing the non-fixed segment with the substitution word for each of the pattern sentences;
  
  a first synthetic-sound generating unit configured to generate a first synthetic sound, which is a synthetic sound of the fixed segment, for each of the target sentences;
  
  a second synthetic-sound generating unit configured to generate a second synthetic sound, which is a synthetic sound of the substitution word, for each of the target sentences;
  
  a calculating unit configured to calculate a discontinuity value of a boundary between the first synthetic sound and the second synthetic sound, for each of the target sentences;
  
  a selecting unit configured to select one of the target sentences having the smallest discontinuity value from the target sentences; and
  
  a connecting unit configured to connect the first synthetic sound and the second synthetic sound of the target sentence selected.
- View Dependent Claims (4, 5)
- - 4. The device according to claim 1, wherein the calculating unit calculates the discontinuity value taking into account at least one of a spectrum distortion, a fundamental frequency distortion, and a phonological co-occurrence distortion at the boundary between the first synthetic sound and the second synthetic sound.
  - 5. The device according to claim 1, wherein the calculating unit calculates the discontinuity value taking into account a weight-assigned discontinuity value that is generated by assigning a weight to a calculated discontinuity value depending on a frequency with which the selecting unit selects the target sentence.

2. A speech synthesizing device comprising:
- an acquiring unit configured to acquire a pattern sentence, which includes a fixed segment that is not to be replaced with any other word and a non-fixed segment that is to be replaced with another word, and a substitution word that is substituted for the non-fixed segment;
  
  a first sentence generating unit configured to generate a target sentence by replacing the non-fixed segment with the substitution word;
  
  a second sentence generating unit configured to generate an alternative target sentence that has a higher similarity to the target sentence than a threshold;
  
  a first synthetic-sound generating unit configured to generate a first synthetic sound, which is a synthetic sound of the fixed segment, for the target sentence and the alternative target sentence;
  
  a second synthetic-sound generating unit configured to generate a second synthetic sound, which is a synthetic sound of the substitution word, for the target sentence and the alternative target sentence;
  
  a calculating unit configured to calculate a discontinuity value of a boundary between the first synthetic sound and the second synthetic sound, for the target sentence and the alternative target sentence;
  
  a selecting unit configured to select the target sentence or the alternative target sentence, whichever has the smaller discontinuity value; and
  
  a connecting unit configured to connect the first synthetic sound and the second synthetic sound of the target sentence or the alternative target sentence that is selected.
- View Dependent Claims (3)
- - 3. The device according to claim 2, wherein the second sentence generating unit generates the alternative target sentence by performing at least one of operations of changing a word order of the pattern sentence, replacing a word of the pattern sentence with a synonym, and replacing a phrase of the pattern sentence with a different phrase, in addition to replacing the non-fixed segment with the substitution word.

6. A computer program product having a computer readable medium including programmed instructions for synthesizing a speech that, when executed by a computer, causes the computer to perform:
- acquiring a plurality of pattern sentences, which are similar to one another and each include a fixed segment and a non-fixed segment, and a substitution word, the fixed segment is not to be replaced with any other word, the non-fixed segment is to be replaced with another word, the substitution word is substituted for the non-fixed segment;
  
  and a substitution word that is substituted for the non-fixed segment;
  
  generating a plurality of target sentences by replacing the non-fixed segment with the substitution word for each of the pattern sentences;
  
  generating a first synthetic sound, which is a synthetic sound of the fixed segment, for each of the target sentences;
  
  generating a second synthetic sound, which is a synthetic sound of the substitution word, for each of the target sentences;
  
  calculating a discontinuity value of a boundary between the first synthetic sound and the second synthetic sound, for each of the target sentences;
  
  selecting one of the target sentences having the smallest discontinuity value from the target sentences; and
  
  connecting the first synthetic sound and the second synthetic sound of the target sentence selected.

7. A computer program product having a computer readable medium including programmed instructions for synthesizing a speech that, when executed by a computer, causes the computer to perform:
- acquiring a pattern sentence, which includes a fixed segment that is not to be replaced with any other word and a non-fixed segment that is to be replaced with another word, and a substitution word that is to be substituted for the non-fixed segment;
  
  acquiring a pattern sentence, which includes a fixed segment that is not to be replaced with any other word and a non-fixed segment that is to be replaced with another word, and a substitution word that is to be substituted for the non-fixed segment;
  
  generating a target sentence by replacing the non-fixed segment with the substitution word;
  
  generating an alternative target sentence having a higher similarity to the target sentence than a threshold;
  
  generating a first synthetic sound, which is a synthetic sound of the fixed segment, for the target sentence and the alternative target sentence;
  
  generating a second synthetic sound, which is a synthetic sound of the substitution word, for the target sentence and the alternative target sentence;
  
  calculating a discontinuity value of a boundary between the first synthetic sound and the second synthetic sound, for the target sentence and the alternative target sentence;
  
  selecting the target sentence or the alternative target sentence, whichever has the smaller discontinuity value; and
  
  connecting the first synthetic sound and the second synthetic sound of the target sentence or the alternative target sentence that is selected.

8. A speech synthesizing method comprising:
- acquiring a plurality of pattern sentences, which are similar to one another and each include a fixed segment and a non-fixed segment, and a substitution word, the fixed segment is not to be replaced with any other word, the non-fixed segment is to be replaced with another word, the substitution word is substituted for the non-fixed segment;
  
  generating a plurality of target sentences by replacing the non-fixed segment with the substitution word for each of the pattern sentences;
  
  generating a first synthetic sound, which is a synthetic sound of the fixed segment, for each of the target sentences;
  
  generating a second synthetic sound, which is a synthetic sound of the substitution word, for each of the target sentences;
  
  calculating a discontinuity value of a boundary between the first synthetic sound and the second synthetic sound, for each of the target sentences;
  
  selecting one of the target sentences having the smallest discontinuity value from the target sentences; and
  
  connecting the first synthetic sound and the second synthetic sound of the target sentence selected.

9. A speech synthesizing method comprising:
- acquiring a pattern sentence, which includes a fixed segment that is not to be replaced with any other word and a non-fixed segment that is to be replaced with another word, and a substitution word that is to be substituted for the non-fixed segment;
  
  generating a target sentence by replacing the non-fixed segment with the substitution word;
  
  generating an alternative target sentence having a higher similarity to the target sentence than a threshold;
  
  generating a first synthetic sound, which is a synthetic sound of the fixed segment, for the target sentence and the alternative target sentence;
  
  generating a second synthetic sound, which is a synthetic sound of the substitution word, for the target sentence and the alternative target sentence;
  
  calculating a discontinuity value of a boundary between the first synthetic sound and the second synthetic sound, for the target sentence and the alternative target sentence;
  
  selecting the target sentence or the alternative target sentence, whichever has the smaller discontinuity value; and
  
  connecting the first synthetic sound and the second synthetic sound of the target sentence or the alternative target sentence that is selected.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Kabushiki Kaisha Toshiba (Toshiba Corporation), Toshiba Digital Solutions Corporation (Toshiba Corporation)
Original Assignee
Kabushiki Kaisha Toshiba (Toshiba Corporation)
Inventors
Mizutani, Nobuaki

Granted Patent

US 8,626,510 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/260
CPC Class Codes

G10L 13/07 Concatenation rules

G10L 13/08 Text analysis or generation...

SPEECH SYNTHESIZING DEVICE, COMPUTER PROGRAM PRODUCT, AND METHOD

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

16 Citations

9 Claims

Specification

Solutions

Use Cases

Quick Links

SPEECH SYNTHESIZING DEVICE, COMPUTER PROGRAM PRODUCT, AND METHOD

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

16 Citations

9 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links