Please download the dossier by clicking on the dossier button x
×

Prosody generation using syllable-centered polynomial representation of pitch contours

  • US 8,886,539 B2
  • Filed: 03/17/2014
  • Issued: 11/11/2014
  • Est. Priority Date: 12/03/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method for building databases for prosody generation in speech synthesis using one or more processors comprising:

  • A) compile a text corpus of sentences containing all the prosody phenomena of interest;

    B) for each phrase in each said sentence, identify the phrase type;

    C) segment each sentence into syllables, identify the property and context information of each said syllable;

    D) read the sentences by a reference speaker to make a recording of voice signals;

    E) segment the voice signals of each sentence into syllables, each said syllable is aligned with a syllable in the text;

    F) identify the voiced section in each syllable of the voice recording;

    G) calculate pitch values in the said voiced section;

    H) generate a polynomial expansion of the pitch contour of each said voiced section in each syllable by least-squares fitting, comprising the use of Gegenbauer polynomials, which at least have a constant term representing the average pitch of the said syllable;

    I) for all phrases of a given type, generate a polynomial expansion of the values of said average pitch of all syllables in the said phrases using least-squares fitting, to generate an average global pitch contour of the given phrase type;

    J) form a set of syllable pitch parameters for each said syllable by subtracting the value of the global pitch profile at that point from the value of the average pitch of the said syllable together with the rest of polynomial expansion coefficients for the said syllable;

    K) correlate the syllable pitch parameters with the property and context information of the said syllable from an analysis of the text to form a database of syllable pitch parameters;

    L) correlate the intensity and duration parameters of a syllable to the property and context information of the said syllable from an analysis of the text to form a database of intensity and duration.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×