Text-to-speech synthesis system
First Claim
1. A text-to-speech synthesis apparatus comprising:
- storage means for storing phoneme data of a plurality of speaker voices;
selecting means for selecting at least two speaker voices from said plurality of speaker voices;
searching means for searching said storage means for phoneme data of the speaker voices selected by said selecting means; and
text-to-speech synthesis processing means for linking said phoneme data of said speaker voices retrieved by said searching means to convert input data into a synthetic speech;
wherein said text-to-speech synthesis processing means can convert said input data into a synthetic speech including at least two speaker voices.
0 Assignments
0 Petitions
Accused Products
Abstract
The present invention is intended to provide a text-to-speech synthesis apparatus, including a storage for storing phoneme data of a plurality of speakers; a selector for selecting one of the plurality of speakers in accordance with an operation performed by a user; a searcher for searching the storage for phoneme data of the speaker selected by the selector; a text-to-speech synthesis processor for linking the phoneme data of the speaker retrieved by the searcher to convert input data into a synthetic speech; and a fee-charge controller for controlling a fee-charge operation for the user in accordance with the phoneme data selected by the selector. Consequently, the user can perform text-to-speech synthesis on the desired input data such as drama data by use of the obtained phoneme data.
-
Citations
50 Claims
-
1. A text-to-speech synthesis apparatus comprising:
-
storage means for storing phoneme data of a plurality of speaker voices; selecting means for selecting at least two speaker voices from said plurality of speaker voices; searching means for searching said storage means for phoneme data of the speaker voices selected by said selecting means; and text-to-speech synthesis processing means for linking said phoneme data of said speaker voices retrieved by said searching means to convert input data into a synthetic speech; wherein said text-to-speech synthesis processing means can convert said input data into a synthetic speech including at least two speaker voices. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A text-to-speech synthesis apparatus comprising:
-
selecting means for selecting at least two speaker voices; transmitting means for transmitting speaker voice identification data for identifying said speaker voices selected by said selecting means to another apparatus; receiving means for receiving phoneme data of said speaker voices corresponding to said speaker voice identification data transmitted from said transmitting means; and text-to-speech synthesis processing means for linking said phoneme data of said speaker voices received by said receiving means to convert input data into a synthetic speech, wherein said text-to-speech synthesis processing means can convert said input data into a synthetic speech including at least said two speaker voices including said speaker voice corresponding to said speaker voice identification data. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A text-to-speech synthesis apparatus comprising:
-
a memory for storing phoneme data of a plurality of speaker voices; a selecting section for selecting any one of said plurality of speaker voices; a search section for searching said memory for the phoneme data of the speaker voices selected by said selecting section; a text-to-speech synthesis processing section for linking said phoneme data of said speaker voices retrieved by said search section to convert script data into a synthetic speech; a storage section for accumulating said synthetic speech converted from said script data on the basis of the phoneme data of said plurality of speaker voices; and a reproducing section for retrieving said synthetic speech of said speaker voices selected by said selecting section and reproducing said synthetic speech, wherein said text-to-speech synthesis processing means can convert said script into a synthetic speech including at least said two speaker voices including said speaker voice corresponding to said speaker voice identification data. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
-
-
36. A text-to-speech synthesis apparatus comprising:
-
a selecting section for selecting at least two speaker voices; a transmitting section for transmitting, to another apparatus, speaker voice identification data for identifying said speaker voices selected by said selecting section; a receiving section for receiving phoneme data of the speaker voice corresponding to said speaker voice identification data transmitted by said transmitting section and a synthetic speech of said speaker voice; a text-to-speech synthesis processing section for linking said phoneme data of said speaker voice received by said receiving section to convert script data into a synthetic speech; and a reproducing section for reproducing said synthetic speech received by said receiving means; wherein said text-to-speech synthesis processing means can convert said script into a synthetic speech including at least said two speaker voices including said speaker voice corresponding to aid speaker voice identification data. - View Dependent Claims (37, 38, 39, 40, 41, 42, 43, 44, 45, 46)
-
-
47. A text-to-speech synthesis method comprising the steps of:
-
selecting at least two speaker voices; searching phoneme data of the speaker voices selected at selecting step; and text-to-speech synthesis processing for linking said phoneme data of said speaker voices retrieved in said searching step to convert input data into a synthetic speech, wherein said text-to-speech synthesis processing can convert said input data into a synthetic speech including at least said two speaker voices.
-
-
48. A computer readable recording medium device on which is stored a text-to-speech synthesis program which, when implemented by a computer, comprises acts of:
-
selecting at least two speaker voices; searching phoneme data of the speaker voices selected at selecting step; and text-to-speech synthesis processing for linking said phoneme data of said speaker voices retrieved in said searching step to convert input data into a synthetic speech; wherein said text-to-speech synthesis processing can convert said input data into a synthetic speech including at least said two speaker voices.
-
-
49. A text-to-speech synthesis method comprising the steps of:
-
selecting at least two speaker voices; transmitting speaker voice identification data for identifying said speaker voices selected in said selecting step to another apparatus; receiving phoneme data of said speaker voices corresponding to said speaker voice identification data transmitted in said transmitting step; and text-to-speech synthesis processing linking said phoneme data of said speaker voices received in said receiving step to convert input data into a synthetic speech; wherein said text-to-speech synthesis processing can convert said input data into a synthetic speech including at least said two speaker voices.
-
-
50. A computer readable recording medium device on which is stored a text-to-speech synthesis program which, when implemented by a computer, comprises acts of:
-
selecting at least two speaker voices; transmitting speaker voice identification data for identifying said speaker voices selected in said selecting step to another apparatus; receiving phoneme data of said speaker voices corresponding to said speaker voice identification data transmitted in said transmitting step; and text-to-speech synthesis processing linking said phoneme data of said speaker voice received in said receiving step to convert input data into a synthetic speech; wherein said text-to-speech synthesis processing can convert said input data into a synthetic speech including at least said two speaker voices.
-
Specification