×

System and method for supporting text-to-speech

  • US 7,921,014 B2
  • Filed: 07/09/2007
  • Issued: 04/05/2011
  • Est. Priority Date: 08/21/2006
  • Status: Active Grant
First Claim
Patent Images

1. A method of supporting text-to-speech synthesis, the method comprising:

  • acquiring first frequency data set in a language processing unit, the first frequency data indicating appearance frequencies of readings corresponding to text wordings;

    recognizing speech produced by a user reading a learning text;

    generating first learning data by associating recognized readings from the speech with portions of the learning text, or by recognizing both wordings and readings of phrases from the speech;

    generating, based on the first learning data, second frequency data indicating appearance frequencies of readings corresponding to wordings of phrases from the speech;

    generating a plurality of frequency data candidates, each frequency data candidate indicating, for at least one combination of a plurality of continuously-written phrases, an appearance frequency of at least one combination of readings, the appearance frequency of the at least one combination of readings comprising a weighted average of an appearance frequency of the at least one combination of readings from the first frequency data with an appearance frequency of the at least one combination of readings from the second frequency data, wherein each of the plurality of frequency data candidates uses different weights for the weighted average;

    for each one of the plurality of frequency data candidates using different weights for the weighted average, using the language processing unit to generate a set of readings corresponding to the learning text using the one of the plurality of frequency data candidates, wherein the set of readings comprises a subset of readings that match readings of the first learning data, and calculating a ratio of the subset of readings to the set of readings, wherein a first frequency data candidate of the plurality of frequency data candidates has a highest calculated ratio;

    updating frequency data in the language processing unit using the first frequency data candidate with the highest calculated ratio; and

    setting the updated frequency data in the language processing unit.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×