PROSODY MODIFICATION DEVICE, PROSODY MODIFICATION METHOD, AND RECORDING MEDIUM STORING PROSODY MODIFICATION PROGRAM
First Claim
1. A prosody modification device comprising:
- a real voice prosody input part that receives real voice prosody information extracted from an utterance of a human;
a regular prosody generating part that generates regular prosody information having a regular phoneme boundary that determines a boundary between phonemes and a regular phoneme length of a phoneme by using data representing a regular or statistical phoneme length in an utterance of a human with respect to a section including at least a phoneme or a phoneme string to be modified in the real voice prosody information; and
a real voice prosody modification part that resets a real voice phoneme boundary of the phoneme or the phoneme string to be modified in the real voice prosody information by using the regular prosody information generated by the regular prosody generating part so that the real voice phoneme boundary and a real voice phoneme length of the phoneme or the phoneme string to be modified in the real voice prosody information are approximate to an actual phoneme boundary and an actual phoneme length of the utterance of the human, thereby modifying the real voice prosody information.
1 Assignment
0 Petitions
Accused Products
Abstract
A prosody modification device includes: a real voice prosody input part that receives real voice prosody information extracted from an utterance of a human; a regular prosody generating part that generates regular prosody information having a regular phoneme boundary that determines a boundary between phonemes and a regular phoneme length of a phoneme by using data representing a regular or statistical phoneme length in an utterance of a human with respect to a section including at least a phoneme or a phoneme string to be modified in the real voice prosody information; and a real voice prosody modification part that resets a real voice phoneme boundary by using the generated regular prosody information so that the real voice phoneme boundary and a real voice phoneme length of the phoneme or the phoneme string to be modified in the real voice prosody information are approximate to an actual phoneme boundary and an actual phoneme length of the utterance of the human, thereby modifying the real voice prosody information.
44 Citations
12 Claims
-
1. A prosody modification device comprising:
-
a real voice prosody input part that receives real voice prosody information extracted from an utterance of a human; a regular prosody generating part that generates regular prosody information having a regular phoneme boundary that determines a boundary between phonemes and a regular phoneme length of a phoneme by using data representing a regular or statistical phoneme length in an utterance of a human with respect to a section including at least a phoneme or a phoneme string to be modified in the real voice prosody information; and a real voice prosody modification part that resets a real voice phoneme boundary of the phoneme or the phoneme string to be modified in the real voice prosody information by using the regular prosody information generated by the regular prosody generating part so that the real voice phoneme boundary and a real voice phoneme length of the phoneme or the phoneme string to be modified in the real voice prosody information are approximate to an actual phoneme boundary and an actual phoneme length of the utterance of the human, thereby modifying the real voice prosody information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A prosody modification method comprising:
-
a real voice prosody input operation in which a real voice prosody input part provided in a computer receives real voice prosody information extracted from an utterance of a human; a regular prosody generating operation in which a regular prosody generating part provided in the computer generates regular prosody information having a regular phoneme boundary that determines a boundary between phonemes and a regular phoneme length of a phoneme by using data representing a regular or statistical phoneme length in an utterance of a human with respect to a section including at least a phoneme or a phoneme string to be modified in the real voice prosody information; and a real voice prosody modifying operation in which a real voice prosody modification part provided in the computer resets a real voice phoneme boundary of the phoneme or the phoneme string to be modified in the real voice prosody information by using the regular prosody information generated in the regular prosody generating operation so that the real voice phoneme boundary and a real voice phoneme length of the phoneme or the phoneme string to be modified in the real voice prosody information are approximate to an actual phoneme boundary and an actual phoneme length of the utterance of the human, thereby modifying the real voice prosody information.
-
-
12. A recording medium storing a prosody modification program that allows a computer to execute:
-
a real voice prosody input process of receiving real voice prosody information extracted from an utterance of a human; a regular prosody generation process of generating regular prosody information having a regular phoneme boundary that determines a boundary between phonemes and a regular phoneme length of a phoneme by using data representing a regular or statistical phoneme length in an utterance of a human with respect to a section including at least a phoneme or a phoneme string to be modified in the real voice prosody information; and a real voice prosody modification process of resetting a real voice phoneme boundary of the phoneme or the phoneme string to be modified in the real voice prosody information by using the regular prosody information generated in the regular prosody generation process so that the real voice phoneme boundary and a real voice phoneme length of the phoneme or the phoneme string to be modified in the real voice prosody information are approximate to an actual phoneme boundary and an actual phoneme length of the utterance of the human, thereby modifying the real voice prosody information.
-
Specification