×

Phonetic posteriorgrams for many-to-one voice conversion

  • US 10,176,819 B2
  • Filed: 06/09/2017
  • Issued: 01/08/2019
  • Est. Priority Date: 07/11/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method comprising:

  • obtaining a target speech;

    obtaining a source speech;

    generating a target phonetic posteriorgram (PPG) of the target speech by driving a first model with acoustic features of the target speech, the target PPG including a set of values corresponding to a range of times and a range of phonetic classes;

    extracting target mel-cepstral coefficients (MCEP) features from the target speech;

    training a second model using the target MCEP features and the target PPG to obtain a mapping between the target MCEP features and the target PPG;

    generating a source PPG of the source speech by driving the first model with acoustic features of the source speech; and

    converting the source speech into a converted speech using the source PPG and the trained second model.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×