TRAINING AND APPLYING PROSODY MODELS

US 20130085760A1
Filed: 11/29/2012
Published: 04/04/2013
Est. Priority Date: 08/12/2008
Status: Active Grant

First Claim

Patent Images

1-26. -26. (canceled)

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A prosody model is trained with this annotated text. After initial training, the model is applied during speech synthesis to generate speech with non-standard prosody from input text. Multiple prosody models can be used to represent different prosody styles.

Citations

34 Claims

1-26. -26. (canceled)

27. A computer-implementable method for synthesizing audible speech, with varying prosody, from textual content, the method comprising:
- maintaining an inventory of prosody models with lexicons,selecting a subset of multiple prosody models from the inventory of prosody models;
  
  associating prosody models in the subset of multiple prosody models with different segments of a text based on phrases in the text statistically associated with the lexicons of the prosody models;
  
  applying the associated prosody models to the different segments of the text to produce prosody annotations for the text;
  
  considering annotations of the prosody annotations to reconcile conflicting prosody annotations due to multiple prosody models associated with a segment of the text; and
  
  synthesizing audible speech from the text and the reconciled prosody annotations.
- View Dependent Claims (28, 29, 30, 31, 32, 33, 34)
- - 28. The method of claim 27, wherein the reconciling is based on a reconciliation policy.
  - 29. The method of claim 28, wherein the reconciliation policy considers the annotations of the prosody annotations that comprise a prosody model identifier and a prosody model confidence for the prosody annotation.
  - 30. The method of claim 29, wherein annotations of the prosody annotations are represented by markup elements that indicate the scope of the tagged text.
  - 31. The method of claim 30, wherein the reconciliation eliminates conflicting annotations that result from applications of multiple models.
  - 32. The method of claim 31, wherein the selecting is based on input parameters.
  - 33. The method of claim 32, wherein the input parameters indicate geographical information.
  - 34. The method of claim 32, wherein the input parameters indicate the type, identity, or role of a speaker.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Morphism LLC
Original Assignee
Morphism LLC
Inventors
Stephens, James H. Jr.

Granted Patent

US 8,554,566 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/260
CPC Class Codes

G10L 13/08   Text analysis or generation...

G10L 13/10   Prosody rules derived from ...

G10L 15/063   Training

G10L 15/1807   using prosody or stress

TRAINING AND APPLYING PROSODY MODELS

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

34 Claims

Specification

Solutions

Use Cases

Quick Links

TRAINING AND APPLYING PROSODY MODELS

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

34 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links