Automatic donor ranking and selection system and method for voice conversion
First Claim
Patent Images
1. A donor ranking system comprising:
- an acoustical feature extractor which extracts one or more acoustical features from a donor speech sample and a target speaker speech sample; and
an adaptive system which generates a prediction for a voice conversion quality value based on the acoustical features.
0 Assignments
0 Petitions
Accused Products
Abstract
An automatic donor selection algorithm estimates the subjective voice conversion output quality from a set of objective distance measures between the source and target speaker'"'"'s acoustical features. The algorithm learns the relationship of the subjective scores and the objective distance measures through nonlinear regression with an MLP. Once the MLP is trained, the algorithm can be used in the selection or ranking of a set of source speakers in terms of the expected output quality for transformations to a specific target voice.
38 Citations
22 Claims
-
1. A donor ranking system comprising:
-
an acoustical feature extractor which extracts one or more acoustical features from a donor speech sample and a target speaker speech sample; and
an adaptive system which generates a prediction for a voice conversion quality value based on the acoustical features. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for ranking donors comprising:
-
extracting one or more acoustical features from features from a donor speech sample and a target speaker speech sample; and
predicting for a voice conversion quality value based on the acoustical features using a trained adaptive system - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A method for training a donor ranking system comprising:
-
selecting a donor and a target speaker, having vocal characteristics, from a training database of speech samples;
deriving an actual subjective quality value;
extracting one or more acoustical features from a donor voice speech sample and a target speaker voice speech sample;
supplying the one or more acoustical features to an adaptive system;
predicting a predicted subjective quality value using the adaptive system;
calculating an error value between the predicted subjective quality value and the actual subjective quality value; and
adjusting the adaptive system based on the error value. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
Specification