SPEECH SYNTHESIS SYSTEM
First Claim
1. A speech synthesis system comprising:
- speech element information storage unit for storing speech element information representing a speech element capable of synthesizing speech having a degree of naturalness indicating a degree of similarity to speech uttered by a human higher than a predetermined reference value when the speech element is used for synthesizing speech having reference prosody which is prosody serving as a reference;
requested prosody information accepting unit for accepting requested prosody information representing requested prosody which is prosody requested by a user;
intermediate prosody information generating unit for generating intermediate prosody information representing intermediate prosody which is prosody between the reference prosody and the requested prosody; and
speech synthesizing unit for executing a speech synthesis process to synthesize speech based on the generated intermediate prosody information and the stored speech element information.
1 Assignment
0 Petitions
Accused Products
Abstract
When a system (100) is used for synthesizing speech having prosody serving as a reference, the system stores speech element information representing a speech element capable of synthesizing speech having a degree of naturalness indicating a degree of similarity to speech uttered by a human higher than a predetermined reference value (speech element information storage (115)). The system accepts requested prosody information representing prosody requested by the user (requested prosody information accepting part (113)). The system generates intermediate prosody information representing intermediate prosody between the reference prosody and the requested prosody (intermediate prosody information generator (114)). The system executes a speech synthesis process to synthesize speech based on the generated intermediate prosody information and the stored speech element information (speech synthesizer (116)).
-
Citations
14 Claims
-
1. A speech synthesis system comprising:
-
speech element information storage unit for storing speech element information representing a speech element capable of synthesizing speech having a degree of naturalness indicating a degree of similarity to speech uttered by a human higher than a predetermined reference value when the speech element is used for synthesizing speech having reference prosody which is prosody serving as a reference; requested prosody information accepting unit for accepting requested prosody information representing requested prosody which is prosody requested by a user; intermediate prosody information generating unit for generating intermediate prosody information representing intermediate prosody which is prosody between the reference prosody and the requested prosody; and speech synthesizing unit for executing a speech synthesis process to synthesize speech based on the generated intermediate prosody information and the stored speech element information. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A speech synthesis method comprising:
-
in the case that speech element information representing a speech element capable of synthesizing speech having a degree of naturalness indicating a degree of similarity to speech uttered by a human higher than a predetermined reference value, when the speech element is used for synthesizing speech having reference prosody which is prosody serving as a reference, is stored in a storage device, accepting requested prosody information representing requested prosody which is prosody requested by a user; generating intermediate prosody information representing intermediate prosody which is prosody between the reference prosody and the requested prosody; and executing a speech synthesis process to synthesize speech based on the generated intermediate prosody information and the stored speech element information. - View Dependent Claims (9, 10)
-
-
11. A computer-readable medium storing a speech synthesis program comprising instructions for causing an information processing device to realize:
- speech element information storing process unit for storing in a storage device speech element information representing a speech element capable of synthesizing speech having a degree of naturalness indicating a degree of similarity to speech uttered by a human higher than a predetermined reference value when the speech element is used for synthesizing speech having reference prosody which is prosody serving as a reference;
requested prosody information accepting unit for accepting requested prosody information representing requested prosody which is prosody requested by a user; intermediate prosody information generating unit for generating intermediate prosody information representing intermediate prosody which is prosody between the reference prosody and the requested prosody; and speech synthesizing unit for executing a speech synthesis process to synthesize speech based on the generated intermediate prosody information and the stored speech element information. - View Dependent Claims (12, 13)
- speech element information storing process unit for storing in a storage device speech element information representing a speech element capable of synthesizing speech having a degree of naturalness indicating a degree of similarity to speech uttered by a human higher than a predetermined reference value when the speech element is used for synthesizing speech having reference prosody which is prosody serving as a reference;
-
14. A speech synthesis system comprising:
-
speech element information storage means for storing speech element information representing a speech element capable of synthesizing speech having a degree of naturalness indicating a degree of similarity to speech uttered by a human higher than a predetermined reference value when the speech element is used for synthesizing speech having reference prosody which is prosody serving as a reference; requested prosody information accepting means for accepting requested prosody information representing requested prosody which is prosody requested by a user; intermediate prosody information generating means for generating intermediate prosody information representing intermediate prosody which is prosody between the reference prosody and the requested prosody; and speech synthesizing means for executing a speech synthesis process to synthesize speech based on the generated intermediate prosody information and the stored speech element information.
-
Specification