QUANTITATIVE F0 CONTOUR GENERATING DEVICE AND METHOD, AND MODEL LEARNING DEVICE AND METHOD FOR F0 CONTOUR GENERATION
First Claim
1. A quantitative F0 contour generating device, comprising:
- means for generating, for an accent phrase of an utterance obtained by text analysis, accent components of an F0 contour using a given number of target points;
means for generating phrase components of the F0 contour using a limited number of target points, by dividing the utterance to groups each including one or more accent phrases, in accordance with linguistic information including an utterance structure; and
means for generating an F0 contour based on said accent components and said phrase components.
1 Assignment
0 Petitions
Accused Products
Abstract
[Object] An object is to provide an F0 contour synthesizing device based on statistic model, to clarify correspondence between linguistic information and F0 contour while maintaining accuracy.
[Solution] An HMM learning device includes: a parameter estimating unit representing an F0 contour 133 fitting a continuous F0 contour 132 as a sum of phrase components and accent components and estimating target points of these; and an HMM learning means conducting learning of HMM 139 using the fitted F0 contour as training data. The continuous F0 contour may be decomposed to accent components 134, phrase components 136 and micro-prosody components 138, and separate HMMs 140, 142 and 144 may be trained. Using results of text analysis, accent components, phrase components and micro-prosody components are separately synthesized from HMMs 140, 142 and 144 and the results are synthesized to obtain an F0 contour.
57 Citations
8 Claims
-
1. A quantitative F0 contour generating device, comprising:
-
means for generating, for an accent phrase of an utterance obtained by text analysis, accent components of an F0 contour using a given number of target points; means for generating phrase components of the F0 contour using a limited number of target points, by dividing the utterance to groups each including one or more accent phrases, in accordance with linguistic information including an utterance structure; and means for generating an F0 contour based on said accent components and said phrase components.
-
-
2. A quantitative F0 contour generating method, comprising the steps of:
-
generating, for an accent phrase of an utterance obtained by text analysis, accent components of an F0 contour using a given number of target points; generating phrase components of the F0 contour using a limited number of target points, by dividing the utterance to groups each including one or more accent phrases, in accordance with linguistic information including an utterance structure; and generating an F0 contour based on said accent components and said phrase components.
-
-
3-4. -4. (canceled)
-
5. A model learning device for F0 contour generation, comprising:
-
F0 contour extracting means for extracting an F0 contour from a speech data signal; parameter estimating means for estimating target parameters representing phrase components and target parameters representing accent components, for representing an F0 contour fitting the extracted F0 contour by superposition of phrase components and accent components; and model learning means, performing F0 generation model learning, using a continuous F0 contour represented by the target parameters of phrase components and the target parameters of accent components estimated by said parameter estimating means as training data. - View Dependent Claims (6)
-
-
7. A model learning method for F0 contour generation, comprising the steps of:
-
F0 contour extracting step of extracting an F0 contour from a speech data signal; parameter estimating step of estimating target parameters representing phrase components and target parameters representing accent components, for representing an F0 contour fitting the extracted F0 contour by superposition of phrase components and accent components; and model learning step of performing F0 generation model learning, using a continuous F0 contour represented by the target parameters of phrase components and the target parameters of accent components estimated by said parameter estimating means as training data. - View Dependent Claims (8)
-
Specification