Information processing apparatus, information processing method, recording medium, and program
First Claim
Patent Images
1. An information processing apparatus comprising:
- a text input mechanism configured to input text data;
a first display control configured to control display of a first display screen that aids a user to enter setting for speech synthesis;
a first setting input mechanism configured to control input of information representing the setting for speech synthesis, entered by the user with reference to the first display screen, display of which is controlled by said first display control;
a phoneme data holder configured to hold at least one kind of phoneme data used for speech synthesis;
a generator configured to divide the text data input via said text input means according to a predetermined rule to generate a plurality of text groups, the plurality of text groups including at least one phrase having more than one word; and
a speech synthesizer configured to execute speech synthesis using the phoneme data held in said phoneme data holder based on the setting for speech synthesis, input via said first setting input, to generate speech data corresponding to the text data;
wherein said first setting input means receives input of a plurality of settings for speech synthesis, and said speech synthesizer executes speech synthesis to generate speech data of different speech properties for adjacent ones of the plurality of text groups based on the plurality of settings for speech synthesis, input via said first setting input.
1 Assignment
0 Petitions
Accused Products
Abstract
Two types of voice can be set for reading text data of an electronic mail. A user selects a detailed setting button associated with one of the voice types to display a voice setting window, in which setting for the voice can be made individually. A drop-down list box include preset voice types such as woman, man, child, robot, and alien, and also names of voice types corresponding to phonemes created by the user, allowing selection thereof. In relation to a voice selected from the drop-down list box, reading speed, voice pitch, and strength of stress are set according to positions of setting levers.
25 Citations
13 Claims
-
1. An information processing apparatus comprising:
-
a text input mechanism configured to input text data; a first display control configured to control display of a first display screen that aids a user to enter setting for speech synthesis; a first setting input mechanism configured to control input of information representing the setting for speech synthesis, entered by the user with reference to the first display screen, display of which is controlled by said first display control; a phoneme data holder configured to hold at least one kind of phoneme data used for speech synthesis; a generator configured to divide the text data input via said text input means according to a predetermined rule to generate a plurality of text groups, the plurality of text groups including at least one phrase having more than one word; and a speech synthesizer configured to execute speech synthesis using the phoneme data held in said phoneme data holder based on the setting for speech synthesis, input via said first setting input, to generate speech data corresponding to the text data; wherein said first setting input means receives input of a plurality of settings for speech synthesis, and said speech synthesizer executes speech synthesis to generate speech data of different speech properties for adjacent ones of the plurality of text groups based on the plurality of settings for speech synthesis, input via said first setting input. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An information processing method comprising:
-
receiving input of text data; controlling display of a display screen that aids a user to enter setting for speech synthesis; receiving input of information representing the setting for speech synthesis, entered by the user with reference to the display screen; holding step of holding at least one kind of phoneme data used for speech synthesis; dividing the received text data input according to a predetermined rule to generate a plurality of text groups, the plurality of text groups including at least one phrase having more than one word; and executing speech synthesis using the held phoneme data based on the setting for speech synthesis, to generate speech data corresponding to the text data; wherein input of a plurality of settings for speech synthesis is received in receiving input of information representing the setting for speech synthesis, and speech synthesis is executed to generate speech data of different speech properties for adjacent ones of the plurality of text groups based on the plurality of settings for speech synthesis.
-
-
13. A recording medium having recorded thereon a computer-readable program comprising instructions to:
-
receive input of text data; control display of a display screen that aids a user to enter a setting for speech synthesis; receive input of information representing the setting for speech synthesis, entered by the user with reference to the display screen; hold at least one kind of phoneme data used for speech synthesis; divide the text data input according to a predetermined rule to generate a plurality of text groups, the plurality of text groups including at least one phrase having more than one word; and execute speech synthesis using the held phoneme data based on the setting for speech synthesis, to generate speech data corresponding to the text data; wherein input of a plurality of settings for speech synthesis is received in receiving input of information representing the setting for speech synthesis and speech synthesis is executed to generate speech data of different speech properties for adjacent ones of the plurality of text groups based on the plurality of settings for speech synthesis, input in said setting input step.
-
Specification