Synthesis of speech from text
First Claim
1. An apparatus for synthesizing speech from text, comprising:
- a language processing section determining an accent environment of each mora of each phrase of the text, said accent environment including a height of an accent of each mora;
a basic accent pattern table in which a basic accent pattern has been classified according to an accent environment of the mora, the basic accent pattern including pitch data which has been edited from real voice data according to the accent environment;
a basic accent pattern processing section selecting the basic accent pattern of each mora from said basic accent pattern table according to the accent environment and processing the basic accent pattern in a pitch according to the accent environment;
a correcting section receiving the basic access pattern in the pitch in said basic accent pattern processing section and correcting the pitch according to the number of moras in each phrase and the position of the moras in the phrase so as to correct the data in the corrected accent component;
a phrase pattern processing section determining a phrase component according to the number of moras in each phrase of the accent environment; and
a speech synthesizing section synthesizing speech according to an accent control pattern of the text which is obtained by adding the basic accent pattern and the basic phrase pattern.
0 Assignments
0 Petitions
Accused Products
Abstract
An apparatus for synthesizing speech from text includes a language processing section which determines an accent environment of each mora of the text. In a basic accent pattern table, a basic accent pattern is classified according to the accent environment of the mora. The basic accent pattern includes a pitch data which is edited from real voice data according to the accent environment. A basic accent pattern processing section selects the basic accent pattern of each more from the basic accent pattern table according to the accent environment and processes the basic accent pattern in pitch according to the accent environment. A correcting section receives the corrected pitch data in the basic accent patter processing section and corrects the corrected pitch data according to the number of mora in each phrase and the position of the mora in phrase so as to correct the data into the corrected accent component. A phrase pattern processing section determines a phrase component according to the number of mora in each phrase which is of the accent environment. A speech synthesizing section synthesizes speech according to an accent control pattern of the text which is obtained by adding the accent pattern and the phrase pattern.
-
Citations
10 Claims
-
1. An apparatus for synthesizing speech from text, comprising:
-
a language processing section determining an accent environment of each mora of each phrase of the text, said accent environment including a height of an accent of each mora; a basic accent pattern table in which a basic accent pattern has been classified according to an accent environment of the mora, the basic accent pattern including pitch data which has been edited from real voice data according to the accent environment; a basic accent pattern processing section selecting the basic accent pattern of each mora from said basic accent pattern table according to the accent environment and processing the basic accent pattern in a pitch according to the accent environment; a correcting section receiving the basic access pattern in the pitch in said basic accent pattern processing section and correcting the pitch according to the number of moras in each phrase and the position of the moras in the phrase so as to correct the data in the corrected accent component; a phrase pattern processing section determining a phrase component according to the number of moras in each phrase of the accent environment; and a speech synthesizing section synthesizing speech according to an accent control pattern of the text which is obtained by adding the basic accent pattern and the basic phrase pattern. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An accent pattern calculating section in an accent control section of a speech synthesizer, the speech synthesizer having a text input section for inputting text data, the text input section being connected to a language processing section for analyzing the content of the text with morpheme analysis, an accent pattern component obtained from said accent pattern calculating section being combined with a phrase component formed in a phrase pattern calculating section, said accent pattern calculating section comprising:
-
a basic accent pattern table having a basic accent pattern classified according to an accent environment of the mora which includes a height of an accent of each mora, the basic accent pattern including pitch data which has been edited from a real voice data according to the accent environment; a basic accent pattern processing section selecting the basic accent pattern of each mora from said basic accent pattern table according to the accent environment and processing the basic accent pattern in a pitch according to the accent environment; and a correcting section receiving the basic accent pattern with the pitch from said basic accent pattern processing section and correcting the pitch according to the number of moras in each phrase and the position of the moras in the phrase, so as to correct the data in a corrected accent component.
-
-
10. A method for synthesizing speech from text, comprising the steps of:
-
a) inputting text data into a text input section; b) analyzing the contents of the text in a language processing section with morpheme analysis; c) obtaining an accent pattern component from an accent pattern calculating section; d) obtaining a phrase component from a phrase pattern calculating section; and e) combining said accent pattern component with said phrase component, wherein said step c) further comprises the steps of classifying a basic accent pattern in a basic accent pattern table according to an accent environment of each mora of the text data, the basic accent pattern including pitch data which has been edited from real voice data according to the accent environment; selecting the basic accent pattern of each mora from said basic accent pattern table in a basic accent pattern processing section according to the accent environment and processing the basic accent pattern in a pitch according to the accent environment; and receiving the basic accent pattern with the pitch from said basic accent pattern processing section in a correcting section and correcting the pitch according to the number of moras in each phrase and the position of the moras in the phrase so as to correct the data in a corrected accent component.
-
Specification