×

Employing speech models in concatenative speech synthesis

  • US 6,950,798 B1
  • Filed: 03/02/2002
  • Issued: 09/27/2005
  • Est. Priority Date: 04/13/2001
  • Status: Expired due to Term
First Claim
Patent Images

1. An arrangement for creating synthesized speech from an applied sequence of desired speech unit features parameter sets, D-SUF(i), i=2,3, . . . , comprising:

  • a database that contains a plurality of sets, E(k), k=1,2, . . . ,K, where K is an integer, each set E(k) includinga plurality of associated frames in sequence, each of said frames being represented bya collection of model feature parameters, andT-D data representing a time-domain speech signalcorresponding to said frame, anda collection of unit selection parameters which characterize the model feature parameters of the speech frames in the set E(k);

    a database search engine that, for each applied D-SUF(i), selects from said database a set E(i) having a collection of unit selection parameters that match best said D-SUF(i), and said plurality of frames that are associated with said E(i), thus creating a sequence of frames;

    an evaluator that determines, based on assessment of information obtained from said database and pertaining to said E(i), whether modifications are needed to frames of said E(i);

    a modification and synthesis module that, when said evaluator concludes that modifications to frames are needed, modifies the collection of model parameters of those frames that need modification, and generates, for each frame having a modified collection of model parameters, T-D data corresponding to said frame; and

    a combiner that concatenates T-D data of successive frames in said sequence of frames, by employing, for each concatenated frame, the T-D data generated for said concatenated frame by said modification and synthesis module, if such T-D data was generated, or T-D data retrieved for said concatenated frame from said database.

View all claims
  • 10 Assignments
Timeline View
Assignment View
    ×
    ×