Optimization of an objective measure for estimating mean opinion score of synthesized speech
First Claim
1. A method for optimizing an objective measure, from which naturalness of synthesized speech can be estimated, wherein naturalness is a subjective quality of synthesized speech, the method comprising:
- generating a set of synthesized utterances;
subjectively rating each of the synthesized utterances;
calculating a score for each of the synthesized utterances using an objective measure, the objective measure being a function of textual information derived from the utterances;
ascertaining a relationship between the scores of the objective measure and subjective ratings of the synthesized utterances; and
altering the objective measure in a manner beyond only changing one or more weighting factors in the objective measure to provide a different function of textual information derived from the utterances so as to improve the relationship between the scores of the objective measure and subjective ratings of the synthesized utterances.
2 Assignments
0 Petitions
Accused Products
Abstract
A method is provided for optimizing an objective measure used to estimate mean opinion score or naturalness of synthesized speech from a speech synthesizer. The method includes using an objective measure that has components derived directly from textual information used to form synthesized utterances. The objective measure has a high correlation with mean opinion score such that a relationship can be formed between the objective measure and corresponding mean opinion score. The objective measure is altered to provide a different function of textual information derived from the utterances so as to improve the relationship between the scores of the objective measure and subjective ratings of the synthesized utterances.
17 Citations
25 Claims
-
1. A method for optimizing an objective measure, from which naturalness of synthesized speech can be estimated, wherein naturalness is a subjective quality of synthesized speech, the method comprising:
-
generating a set of synthesized utterances; subjectively rating each of the synthesized utterances; calculating a score for each of the synthesized utterances using an objective measure, the objective measure being a function of textual information derived from the utterances; ascertaining a relationship between the scores of the objective measure and subjective ratings of the synthesized utterances; and altering the objective measure in a manner beyond only changing one or more weighting factors in the objective measure to provide a different function of textual information derived from the utterances so as to improve the relationship between the scores of the objective measure and subjective ratings of the synthesized utterances. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method for optimizing an objective measure, from which naturalness of synthesized speech can be estimated, wherein naturalness is a subjective quality of synthesized speech, the method comprising:
-
generating a set of synthesized utterances; subjectively rating each of the synthesized utterances; calculating a score for each of the synthesized utterances using an objective measure, the objective measure being a function of textual information derived from speech units used in the utterances and the objective measure comprising components being based on single-order textual features or a combination of at least two single-order textual features, the components having categorical values, wherein a distance between categories are empirically defined as values in distance tables, the components each further having a weighting value; ascertaining a relationship between the scores of the objective measure and subjective ratings of the synthesized utterances; and altering the objective measure in a manner beyond only changing one or more weighting factors in the objective measure to provide a different function of textual information derived from the utterances so as to improve the relationship between the scores of the objective measure and subjective ratings of the synthesized utterances, wherein altering comprises altering the values in the distance tables followed by altering the weighting values. - View Dependent Claims (22, 23, 24, 25)
-
Specification