Speech synthesis method and apparatus for electronic system
First Claim
Patent Images
1. A speech synthesis method for an electronic system, the speech synthesis method comprising:
- performing a text tagging process, comprising;
receiving a speech signal file, wherein the speech signal file comprises text content and prosodic information, wherein the speech signal file is a recorded file of human voice from a user to recite a text content and received by a voice input unit;
analyzing the speech signal file to obtain the prosodic information and the text content of the speech signal file, respectively; and
automatically tagging the text content and the corresponding prosodic information to obtain a text tag file; and
performing a prosody mimicking process, comprising;
combining a human voice profile and the text tag file to obtain a speech synthesis file, wherein a speech synthesis sound is produced when the speech synthesis file is broadcasted.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech synthesis method for an electronic system and a speech synthesis apparatus are provided. In the speech synthesis method, a speech signal file including text content is received. The speech signal file is analyzed to obtain prosodic information of the speech signal file. The text content and the corresponding prosodic information are automatically tagged to obtain a text tag file. A speech synthesis file is obtained by synthesizing a human voice profile and the text tag file.
-
Citations
10 Claims
-
1. A speech synthesis method for an electronic system, the speech synthesis method comprising:
-
performing a text tagging process, comprising; receiving a speech signal file, wherein the speech signal file comprises text content and prosodic information, wherein the speech signal file is a recorded file of human voice from a user to recite a text content and received by a voice input unit; analyzing the speech signal file to obtain the prosodic information and the text content of the speech signal file, respectively; and automatically tagging the text content and the corresponding prosodic information to obtain a text tag file; and performing a prosody mimicking process, comprising; combining a human voice profile and the text tag file to obtain a speech synthesis file, wherein a speech synthesis sound is produced when the speech synthesis file is broadcasted. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A speech synthesis apparatus comprising:
-
a text tagging apparatus receiving a speech signal file, wherein the speech signal file comprises text content and prosodic information, and the text tagging apparatus comprises; a text recognizer analyzing the speech signal file to obtain the text content of the speech signal file, wherein the speech signal file is a recorded file of human voice from a user to recite a text content and received by a voice input unit; a prosody analyzer analyzing the speech signal file to obtain the prosodic information of the speech signal file; and a tagging device automatically tagging the text content and the corresponding prosodic information to obtain a text tag file; and a prosody mimicking apparatus receiving the text tag file and comprising; an analyzer analyzing the text tag file to obtain the text content and the prosodic information; and a speech synthesizer combining a human voice profile, the text content, and the prosodic information to obtain the speech synthesis file, wherein a speech synthesis sound is produced when the speech synthesis file is broadcasted by the speech synthesizer. - View Dependent Claims (9, 10)
-
Specification