Speech synthesis method and information providing apparatus

US 20070094029A1
Filed: 05/16/2006
Published: 04/26/2007
Est. Priority Date: 12/28/2004
Status: Abandoned Application

First Claim

Patent Images

1. A speech synthesis method comprising:

predicting a playback duration of synthesized speech to be generated based on text;

judging whether a constraint condition concerning a playback timing of the synthesized speech is satisfied or not, based on the predicted playback duration;

in the case where said judging shows that the constraint condition is not satisfied, shifting a playback starting timing of the synthesized speech of the text forward or backward, and modifying contents indicating time or distance in the text, in accordance with a duration by which the playback starting timing of the synthesized speech is shifted; and

generating synthesized speech based on the text with the modified contents, and playing back the synthesized speech.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

To provide a speech synthesis method of reading out units of synthesized speech without fail and in an easy to understand manner, even when playback of the units of synthesized speech are simultaneously requested. The duration prediction unit predicts the playback duration of synthesized speech to be synthesized based on text. The time constraint satisfaction judgment unit judges whether a constraint condition concerning the playback timing of the synthesized speech is satisfied or not, based on the predicted playback duration. If it judged that the constraint condition is not satisfied, the content modification unit shifts the playback starting timing of the synthesized speech of the text forward or backward, and modifies the contents of the text indicating time and distance in accordance with the shifted time. The synthesized speech generation unit generates synthesized speech based on the text having the modified contents and plays it back.

Citations

8 Claims

1. A speech synthesis method comprising:
- predicting a playback duration of synthesized speech to be generated based on text;
  
  judging whether a constraint condition concerning a playback timing of the synthesized speech is satisfied or not, based on the predicted playback duration;
  
  in the case where said judging shows that the constraint condition is not satisfied, shifting a playback starting timing of the synthesized speech of the text forward or backward, and modifying contents indicating time or distance in the text, in accordance with a duration by which the playback starting timing of the synthesized speech is shifted; and
  
  generating synthesized speech based on the text with the modified contents, and playing back the synthesized speech.
- View Dependent Claims (2, 3, 4)
- - 2. The speech synthesis method according to claim 1, wherein:
    - in the case where there are plural units of speech, said predicting includes predicting a playback duration of second synthesized speech, playback of the second synthesized speech needing to be completed before playback of first synthesized speech starts;
      
      said judging includes judging that the constraint condition is not satisfied, in the case where the predicted playback duration of the second synthesized speech indicates that the playback of the second synthesized speech is not completed before the playback of the first synthesized speech starts;
      
      said shifting includes delaying a playback starting timing of the first synthesized speech to a predicted playback completion time of the second synthesized speech, and said modifying includes modifying the contents of text based on which the first synthesized speech is generated, said shifting and modifying being performed in the case where said judging shows that the constraint condition is not satisfied; and
      
      said generating includes generating synthesized speech based on the text with the modified contents and playing back the synthesized speech, after completing the playback of the second synthesized speech.
  - 3. The speech synthesis method according to claim 2, wherein said modifying further includes reducing the playback duration of the second synthesized speech by summarizing the text based on which the second synthesized speech is generated, and delaying the playback starting timing of the first synthesized speech to a time at which the playback of the second synthesized speech with the reduced playback duration is completed.
  - 4. The speech synthesis method according to claim 1, wherein:
    - said predicting includes predicting a playback duration of synthesized speech, the playback of the synthesized speech needing to be completed by a preset time;
      
      said judging includes judging that the constraint condition is not satisfied, in the case where the predicted playback duration of the synthesized speech indicates that the playback of the second synthesized speech is not completed by the preset time;
      
      said shifting includes delaying the playback starting timing of the synthesized speech by a duration starting from the preset time indicated in the text based on which the synthesized speech is generated, and said modifying includes modifying the preset time in accordance with the duration by which the playback starting timing of the synthesized speech is delayed, said shifting and modifying being performed in the case where said judging shows that the constraint condition is not satisfied; and
      
      said generating includes generating synthesized speech based on the text with the modified contents and playing back the synthesized speech.

5. An information providing apparatus comprising:
- a duration prediction unit operable to predict a playback duration of synthesized speech to be generated based on text;
  
  a time constraint satisfaction judgment unit operable to judge whether a constraint condition concerning a playback timing of the synthesized speech is satisfied or not, based on the predicted playback duration;
  
  a content modification unit operable to shift a playback starting timing of the synthesized speech of the text forward or backward, and modify contents indicating time or distance in the text, in accordance with a duration by which the playback starting timing of the synthesized speech is shifted, in the case where said time constraint satisfaction judgment unit judges that the constraint condition is not satisfied; and
  
  a synthesized speech generation unit operable to generate synthesized speech based on the text with the modified contents, and play back the synthesized speech.
- View Dependent Claims (6, 7)
- - 6. The information providing apparatus according to claim 5, wherein:
    - said information providing apparatus is operable to function as a car navigation apparatus which provides a speech guidance concerning a route to a destination;
      
      said information providing apparatus further includes a speed obtainment unit operable to obtain a moving speed of a car;
      
      said duration prediction unit is operable to predict a playback duration of a second synthesized speech, the playback of the second synthesized speech needing to be completed before playback of a first synthesized speech is started;
      
      said time constraint satisfaction judgment unit is operable to judge that the constraint condition is not satisfied, in the case where the predicted playback duration of the second synthesized speech indicates that the playback of the second synthesized speech is not completed before the playback of the first synthesized speech starts;
      
      said content modification unit is operable to delay a playback starting timing of the first synthesized speech to a predicted time at which the playback of the second synthesized speech is completed, and modify a distance to a predetermined location in accordance with a moving distance corresponding to the delay of the playback starting timing of the first synthesized speech, in the case where said time constraint satisfaction judgment unit judges that the constraint condition is not satisfied, the predetermined location being indicated in the text based on which the first synthesized speech is generated and the moving distance being calculated from the moving speed obtained by said speed obtainment unit; and
      
      said synthesized speech generation unit is operable to generate the first synthesized speech based on the text with the modified contents and play back the first synthesized speech, after completing the playback of the second synthesized speech.
  - 7. The information providing apparatus according to claim 5, wherein:
    - said information providing apparatus is operable to function as a scheduler which reads out a schedule registered by a user using synthesized speech at a preset time which is before a start time of the schedule;
      
      said information providing apparatus further includes a registration unit operable to accept registration of the user'"'"'s schedule, the start time of the schedule and the preset time;
      
      said duration prediction unit is operable to predict a playback duration of synthesized speech, the playback of the synthesized speech needing to be played back by the preset time;
      
      said time constraint satisfaction judgment unit is operable to judge that the constraint condition is not satisfied, in the case where the predicted playback duration of the synthesized speech indicates that the playback of the synthesized speech is not completed by the preset time;
      
      said content modification unit is operable to delay a playback starting timing of the synthesized speech to a time which is earlier than the start time of the schedule, and modify a duration before the start time of the schedule in accordance with the duration by which the playback starting timing of the synthesized speech is delayed, in the case where said time constraint satisfaction judgment unit judges that the constraint condition is not satisfied, the time to be modified being indicated in the text based on which the synthesized speech is generated; and
      
      said synthesized speech generation unit is operable to generate synthesized speech based on the text with the modified contents and play back the synthesized speech.

8. A program intended for an information providing apparatus, said program causing a computer to execute:
- predicting a playback duration of synthesized speech to be generated based on text;
  
  judging whether a constraint condition concerning a playback timing of the synthesized speech is satisfied or not, based on the predicted playback duration;
  
  in the case where said judging shows that the constraint condition is not satisfied, shifting a playback starting timing of the synthesized speech of the text forward or backward, and modifying contents indicating time or distance in the text, in accordance with a duration by which the playback starting timing of the synthesized speech is shifted; and
  
  generating synthesized speech based on the text with the modified contents, and playing back the synthesized speech.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Panasonic Corporation (Panasonic Holdings Corporation)
Original Assignee
Panasonic Corporation (Panasonic Holdings Corporation)
Inventors
Hirose, Yoshifumi, Kamai, Takahiro, Saito, Natsuki, Kato, Yumiko

Application Number

US11/434,153
Publication Number

US 20070094029A1
Time in Patent Office

Days
Field of Search
US Class Current

704/260
CPC Class Codes

G10L 13/033 Voice editing, e.g. manipul...

Speech synthesis method and information providing apparatus

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

Speech synthesis method and information providing apparatus

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links