×

Devices and methods for speech unit reduction in text-to-speech synthesis systems

  • US 8,751,236 B1
  • Filed: 10/23/2013
  • Issued: 06/10/2014
  • Est. Priority Date: 10/23/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving, at a device, a plurality of speech sounds that are each indicative of a different full pronunciation of a first linguistic term, wherein the first linguistic term includes a representation of one or more phonemes;

    determining, by the device, concatenation features of the plurality of speech sounds of the first linguistic term, wherein the concatenation features are indicative of an acoustic transition between a first speech sound and a second speech sound when the first speech sound and the second speech sound are concatenated, wherein the first speech sound is included in the plurality of speech sounds of the first linguistic term and the second speech sound is indicative of a pronunciation of a second linguistic term;

    clustering, based on the concatenation features, the plurality of speech sounds into one or more clusters, wherein a given cluster includes one or more speech sounds of the plurality of speech sounds that have given concatenation features that are related by a clustering metric; and

    based on a determination that the first speech sound has the given concatenation features represented in the given cluster, providing a representative speech sound of the given cluster as the first speech sound when the first speech sound and the second speech sound are concatenated.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×