×

Annotating phonemes and accents for text-to-speech system

  • US 8,751,235 B2
  • Filed: 08/03/2009
  • Issued: 06/10/2014
  • Est. Priority Date: 07/12/2005
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for processing an input text, the input text comprising an input character string, the method comprising acts of:

  • identifying a first segmentation of the input character string, the first segmentation forming a first candidate sequence of words corresponding to the input character string, wherein the first candidate sequence of words comprises at least one first word having at least one character and a first pronunciation;

    determining, based at least in part on statistical information regarding phonemes and/or accents for pronouncing character strings, a first occurrence probability for the first candidate sequence of words, wherein the statistical information comprises information indicative of a frequency at which the at least one character is associated with the first pronunciation;

    identifying a second segmentation of the input character string, the second segmentation being different from the first segmentation and forming a second candidate sequence of words corresponding to the input character string, wherein the second candidate sequence of words comprises at least one second word having the same at least one character as the first word but a second pronunciation that is different from the first pronunciation of the first word;

    determining, based at least in part on the statistical information regarding phonemes and/or accents for pronouncing character strings, a second occurrence probability for the second candidate sequence of words, wherein the statistical information further comprises information indicative of a frequency at which the at least one character is associated with the second pronunciation; and

    selecting, based at least in part on the first and second occurrence probabilities, a selected sequence of words from a plurality of candidate sequences of words comprising the first and second candidate sequences of words.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×