System and method for converting text-to-voice
First Claim
1. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of speech items and a corresponding plurality of voice recordings wherein each speech item corresponds to at least one available voice recording wherein multiple voice recordings that correspond to a single speech item represent various inflections of that single speech item, the method including receiving text data, converting the text data into a sequence of speech items in accordance with the digital voice library, the method further comprising:
- establishing multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, that represent various ligatures for the single inflection of the single speech item with adjacent speech items.
4 Assignments
0 Petitions
Accused Products
Abstract
A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules is provided. The digital voice library includes a plurality of speech items and a corresponding plurality of voice recordings. Each speech item corresponds to at least one available voice recording. Multiple voice recordings that correspond to a single speech item represent various inflections of that single speech item. The method includes receiving text data, converting the text data into a sequence of speech items in accordance with the digital voice library. The method further includes establishing multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, that represent various ligatures for the single inflection of the single speech item with adjacent speech items.
-
Citations
16 Claims
-
1. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of speech items and a corresponding plurality of voice recordings wherein each speech item corresponds to at least one available voice recording wherein multiple voice recordings that correspond to a single speech item represent various inflections of that single speech item, the method including receiving text data, converting the text data into a sequence of speech items in accordance with the digital voice library, the method further comprising:
establishing multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, that represent various ligatures for the single inflection of the single speech item with adjacent speech items. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
9. A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules, the digital voice library including a plurality of speech items and a corresponding plurality of voice recordings wherein each speech item corresponds to at least one available voice recording wherein multiple voice recordings that correspond to a single speech item represent various inflections of that single speech item, the method including receiving text data, converting the text data into a sequence of speech items in accordance with the digital voice library, the method further comprising:
-
establishing multiple voice recordings in the digital voice library that correspond to a single inflection of a single speech item, for a plurality of inflections of a plurality of speech items, that represent various ligatures for the single inflection of the single speech item with adjacent speech items;
determining a desired inflection for each speech item in the sequence of speech items based on the set of playback rules;
determining a sequence of voice recordings by determining a voice recording for each speech item based on the desired inflection for the particular speech item, the available voice recordings that correspond to the particular speech item, and the ligatures for the particular speech item with adjacent speech items; and
generating voice data based on the sequence of voice recordings by concatenating adjacent recordings in the sequence of voice recordings. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
Specification