System and method for blending synthetic voices
First Claim
Patent Images
1. A tangible computer-readable medium storing instructions for controlling a computing device to generate a synthetic voice, the instructions comprising:
- receiving a user selection of a first text-to-speech voice and a selected voice characteristic for modifying the first text-to-speech voice;
selecting the first text-to-speech voice from a plurality of text-to-speech voices;
selecting a second text-to-speech voice exhibiting the selected voice characteristic; and
presenting the user with a new text-to-speech voice comprising the first text-to-speech voice modified with at least the selected voice characteristic from the second text-to-speech voice.
17 Assignments
0 Petitions
Accused Products
Abstract
A system and method for generating a synthetic text-to-speech TTS voice are disclosed. A user is presented with at least one TTS voice and at least one voice characteristic. A new synthetic TTS voice is generated by blending a plurality of existing TTS voices according to the selected voice characteristics. The blending of voices involves interpolating segmented parameters of each TTS voice. Segmented parameters may be, for example, prosodic characteristics of the speech such as pitch, volume, phone durations, accents, stress, mis-pronunciations and emotion.
29 Citations
21 Claims
-
1. A tangible computer-readable medium storing instructions for controlling a computing device to generate a synthetic voice, the instructions comprising:
-
receiving a user selection of a first text-to-speech voice and a selected voice characteristic for modifying the first text-to-speech voice; selecting the first text-to-speech voice from a plurality of text-to-speech voices; selecting a second text-to-speech voice exhibiting the selected voice characteristic; and presenting the user with a new text-to-speech voice comprising the first text-to-speech voice modified with at least the selected voice characteristic from the second text-to-speech voice. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of generating a synthetic voice, the method comprising:
-
receiving a user selection of a first text-to-speech voice and a selected voice characteristic for modifying the first text-to-speech voice; selecting the first text-to-speech voice from a plurality of text-to-speech voices; selecting a second text-to-speech voice exhibiting the selected voice characteristic; and presenting the user with a new text-to-speech voice comprising the first text-to-speech voice modified with at least the selected voice characteristic from the second text-to-speech voice. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A system for generating a synthetic voice, the system comprising:
-
a first module configured to control a processor to receive a user selection of a first text-to-speech voice and a selected voice characteristic for modifying the first text-to-speech voice; a second module configured to control the processor to select the first text-to-speech voice from a plurality of text-to-speech voices; a third module for configured to control the processor to select a second text-to-speech voice exhibiting the selected voice characteristic; a fourth module configured to control the processor to present the user with a new text-to-speech comprising the first text-to-speech voice modified with the selected voice characteristic from the second text-to-speech voice. - View Dependent Claims (18, 19, 20, 21)
-
Specification