Information processing apparatus, information processing method, recording medium, and program
First Claim
Patent Images
1. An information processing apparatus comprising:
- text input means for receiving input of text data;
first display control means for controlling display of a first display screen that aids a user to enter setting for speech synthesis;
first setting input means for receiving input of information representing the setting for speech synthesis, entered by the user with reference to the first display screen, display of which is controlled by said first display control means;
phoneme data holding means for holding at least one kind of phoneme data used for speech synthesis;
generation means for dividing the text data input via said text input means according to a predetermined rule to generate a plurality of text groups; and
speech synthesis means for executing speech synthesis using the phoneme data held in said phoneme data holding means based on the setting for speech synthesis, input via said first setting input means, to generate speech data corresponding to the text data;
wherein said first setting input means receives input of a plurality of settings for speech synthesis, and said speech synthesis means executes speech synthesis to generate speech data of different speech properties for adjacent ones of the plurality of text groups based on the plurality of settings for speech synthesis, input via said first setting input means.
1 Assignment
0 Petitions
Accused Products
Abstract
Two types of voice can be set for reading text data of an electronic mail. A user selects a detailed setting button associated with one of the voice types to display a voice setting window, in which setting for the voice can be made individually. A drop-down list box include preset voice types such as woman, man, child, robot, and alien, and also names of voice types corresponding to phonemes created by the user, allowing selection thereof. In relation to a voice selected from the drop-down list box, reading speed, voice pitch, and strength of stress are set according to positions of setting levers.
30 Citations
14 Claims
-
1. An information processing apparatus comprising:
-
text input means for receiving input of text data;
first display control means for controlling display of a first display screen that aids a user to enter setting for speech synthesis;
first setting input means for receiving input of information representing the setting for speech synthesis, entered by the user with reference to the first display screen, display of which is controlled by said first display control means;
phoneme data holding means for holding at least one kind of phoneme data used for speech synthesis;
generation means for dividing the text data input via said text input means according to a predetermined rule to generate a plurality of text groups; and
speech synthesis means for executing speech synthesis using the phoneme data held in said phoneme data holding means based on the setting for speech synthesis, input via said first setting input means, to generate speech data corresponding to the text data;
wherein said first setting input means receives input of a plurality of settings for speech synthesis, and said speech synthesis means executes speech synthesis to generate speech data of different speech properties for adjacent ones of the plurality of text groups based on the plurality of settings for speech synthesis, input via said first setting input means. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An information processing method comprising:
-
a text input step of receiving input of text data;
a display control step of controlling display of a display screen that aids a user to enter setting for speech synthesis;
a setting input step of receiving input of information representing the setting for speech synthesis, entered by the user with reference to the display screen, display of which is controlled in said display control step;
a phoneme data holding step of holding at least one kind of phoneme data used for speech synthesis;
a generation step of dividing the text data input in said text input step according to a predetermined rule to generate a plurality of text groups; and
a speech synthesis step of executing speech synthesis using the phoneme data held in said phoneme data holding step based on the setting for speech synthesis, input in said setting input step, to generate speech data corresponding to the text data;
wherein input of a plurality of settings for speech synthesis is received in said setting input step, and speech synthesis is executed in said speech synthesis step to generate speech data of different speech properties for adjacent ones of the plurality of text groups based on the plurality of settings for speech synthesis, input in said setting input step.
-
-
13. A recording medium having recorded thereon a computer-readable program comprising:
-
a text input step of receiving input of text data;
a display control step of controlling display of a display screen that aids a user to enter setting for speech synthesis;
a setting input step of receiving input of information representing the setting for speech synthesis, entered by the user with reference to the display screen, display of which is controlled in said display control step;
a phoneme data holding step of holding at least one kind of phoneme data used for speech synthesis;
a generation step of dividing the text data input in said text input step according to a predetermined rule to generate a plurality of text groups; and
a speech synthesis step of executing speech synthesis using the phoneme data held in said phoneme data holding step based on the setting for speech synthesis, input in said setting input step, to generate speech data corresponding to the text data;
wherein input of a plurality of settings for speech synthesis is received in said setting input step, and speech synthesis is executed in said speech synthesis step to generate speech data of different speech properties for adjacent ones of the plurality of text groups based on the plurality of settings for speech synthesis, input in said setting input step.
-
-
14. A program for having a computer execute a process comprising:
-
a text input step of receiving input of text data;
a display control step of controlling display of a display screen that aids a user to enter setting for speech synthesis;
a setting input step of receiving input of information representing the setting for speech synthesis, entered by the user with reference to the display screen, display of which is controlled in said display control step;
a phoneme data holding step of holding at least one kind of phoneme data used for speech synthesis;
a generation step of dividing the text data input in said text input step according to a predetermined rule to generate a plurality of text groups; and
a speech synthesis step of executing speech synthesis using the phoneme data held in said phoneme data holding step based on the setting for speech synthesis, input in said setting input step, to generate speech data corresponding to the text data;
wherein input of a plurality of settings for speech synthesis is received in said setting input step, and speech synthesis is executed in said speech synthesis step to generate speech data of different speech properties for adjacent ones of the plurality of text groups based on the plurality of settings for speech synthesis, input in said setting input step.
-
Specification