×

Methods and apparatus for predicting prosody in speech synthesis

  • US 9,286,886 B2
  • Filed: 01/24/2011
  • Issued: 03/15/2016
  • Est. Priority Date: 01/24/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • comparing an input text to a data set of text fragments to select a corresponding text fragment for at least a portion of the input text, wherein selecting the corresponding text fragment comprisesidentifying within the at least a portion of the input text a first sequence of words beginning with a first function word and including one or more words following the first function word,identifying a grammatical type of the first function word beginning the first sequence of words,constraining the identified first sequence of words within the at least a portion of the input text to be matched as a unit to a contiguous sequence of words in a text fragment in the data set, andselecting as the corresponding text fragment a text fragment including as the contiguous sequence of words a second sequence of words beginning with a second function word that is a different word from the first function word but is of the same grammatical type as the first function word, the corresponding text fragment being associated with spoken audio of at least the second sequence of words, wherein the second sequence of words within the corresponding text fragment includes at least one word not present in the first sequence of words within the at least a portion of the input text;

    determining an alignment of the corresponding text fragment with the at least a portion of the input text; and

    using a computer, synthesizing speech from the at least a portion of the input text, wherein the synthesizing comprises extracting prosody from the spoken audio of the second sequence of words, including from the at least one word not present in the first sequence of words, and applying the extracted prosody in synthesizing the speech using the alignment of the corresponding text fragment with the at least a portion of the input text.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×