Prosody generating devise, prosody generating method, and program
First Claim
1. A prosody generation apparatus that receives phonological information and linguistic information so as to generate prosody, the prosody generation apparatus being operable to refer to (a) a representative prosodic pattern storage unit for accumulating beforehand representative prosodic patterns of portions of speech data, the portions including prosody changing points;
- (b) a selection rule storage unit that stores a selection rule predetermined according to attributes concerning phonology or attributes concerning linguistic information of the portions of the speech data including the prosody changing points; and
(c) a transformation rule storage unit that stores a transformation rule predetermined according to attributes concerning the phonology or the linguistic information of the portions of the speech data including the prosody changing points;
comprising;
a prosody changing point setting unit that sets a prosody changing point according to at least any one of the received phonological information and the linguistic information;
a pattern selection unit that selects a representative prosodic pattern from the representative prosodic pattern storage unit according to the selection rule, based on the received phonological information and the linguistic information; and
a prosody generation unit that transforms the representative prosodic pattern selected by the pattern selection unit according to the transformation rule and interpolates a portion that does not include a prosody changing point and located between the thus selected and transformed representative patterns each corresponding to a portion including a prosody changing point.
2 Assignments
0 Petitions
Accused Products
Abstract
A prosody generation apparatus capable of suppressing distortion that occurs when generating prosodic patterns and therefore generating a natural prosody is provided. A prosody changing point extraction unit in this apparatus extracts a prosody changing point located at the beginning and the ending of a sentence, the beginning and the ending of a breath group, an accent position and the like. A selection rule and a transformation rule of a prosodic pattern including the prosody changing point is generated by means of a statistical or learning technique and the thus generate rules are stored in a representative prosodic pattern selection rule table and a transformation rule table beforehand. A pattern selection unit selects a representative prosodic pattern from the representative prosodic pattern selection rule table according to the selection rule. A prosody generation unit transforms the selected pattern according to the transformation rule and carries out interpolation with respect to portions other than the prosody changing points so as to generate prosody as a whole.
16 Citations
68 Claims
-
1. A prosody generation apparatus that receives phonological information and linguistic information so as to generate prosody, the prosody generation apparatus being operable to refer to (a) a representative prosodic pattern storage unit for accumulating beforehand representative prosodic patterns of portions of speech data, the portions including prosody changing points;
- (b) a selection rule storage unit that stores a selection rule predetermined according to attributes concerning phonology or attributes concerning linguistic information of the portions of the speech data including the prosody changing points; and
(c) a transformation rule storage unit that stores a transformation rule predetermined according to attributes concerning the phonology or the linguistic information of the portions of the speech data including the prosody changing points;
comprising;
a prosody changing point setting unit that sets a prosody changing point according to at least any one of the received phonological information and the linguistic information;
a pattern selection unit that selects a representative prosodic pattern from the representative prosodic pattern storage unit according to the selection rule, based on the received phonological information and the linguistic information; and
a prosody generation unit that transforms the representative prosodic pattern selected by the pattern selection unit according to the transformation rule and interpolates a portion that does not include a prosody changing point and located between the thus selected and transformed representative patterns each corresponding to a portion including a prosody changing point. - View Dependent Claims (2, 3, 4, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 49, 50, 61, 62, 63, 64, 65, 66, 67, 68)
- (b) a selection rule storage unit that stores a selection rule predetermined according to attributes concerning phonology or attributes concerning linguistic information of the portions of the speech data including the prosody changing points; and
-
5-12. -12. (canceled)
-
45-48. -48. (canceled)
-
51-56. -56. (canceled)
-
57. A prosody generation method by which phonological information and linguistic information are inputted so as to generate prosody, comprising the steps of:
-
setting a prosody changing point according to at least any one of the inputted phonological information and linguistic information;
selecting a prosodic pattern from representative prosodic patterns for portions including prosody changing points of speech data according to a selection rule predetermined beforehand based on attributes concerning phonology or attributes concerning linguistic information of the portions including the prosodic changing points; and
transforming the selected prosodic pattern according to a transformation rule predetermined beforehand based on attributes concerning the phonology or attributes concerning the linguistic information of the portions including the prosodic changing points, and interpolating a portion that does not include a prosody changing point and located between the thus selected and transformed representative patterns each corresponding to a portion including a prosody changing point.
-
-
58. (canceled)
-
59. A program that has a computer conduct a procedure of receiving phonological information and linguistic information so as to generate prosody, the computer being operable to refer to (a) a representative prosodic pattern storage unit for accumulating beforehand representative prosodic patterns of portions of speech data, the portions including prosody changing points;
- (b) a selection rule storage unit that stores a selection rule predetermined according to attributes concerning phonology or attributes concerning linguistic information of the portions of the speech data including the prosody changing points; and
(c) a transformation rule storage unit that stores a transformation rule predetermined according to attributes concerning the phonology or the linguistic information of the portions of the speech data including the prosody changing points;
the program having the computer conduct the steps of;
setting a prosody changing point according to at least any one of the received phonological information and the linguistic information;
selecting a representative prosodic pattern from the representative prosodic pattern storage unit according to the selection rule, based on the received phonological information and the linguistic information; and
transforming the representative prosodic pattern selected by the pattern selection unit according to the transformation rule and interpolating a portion that does not include a prosody changing point and located between the thus selected and transformed representative patterns each corresponding to a portion including a prosody changing point.
- (b) a selection rule storage unit that stores a selection rule predetermined according to attributes concerning phonology or attributes concerning linguistic information of the portions of the speech data including the prosody changing points; and
-
60. (canceled)
Specification