Method for synthesizing speech from text and for spelling all or portions of the text by analogy
First Claim
Patent Images
1. A method of synthesizing human audible speech from a multi-word string of text, the method comprising the steps of:
- treating the multi-word string as a single prosodic paragraph by performing the steps of;
assigning a pitch to the beginning of the multi-word string that is higher than at the end of the multi-word string; and
assigning a pitch to a final end point of the string that is lower than the pitch at any point within the string;
including, in the multi-word string, following at least one of the individual words in the multi-word string, the corresponding spelling of the individual word;
treating each individual word in the multi-word string as a single word declarative sentence;
treating the spelling of each individual word included in the multi-word string as a single word declarative sentence;
grouping each individual word and the corresponding spelling of the individual word into a prosodic group within the single prosodic paragraph, the prosodic group having a higher pitch at the beginning of the prosodic group than at the end of said prosodic group; and
generating speech from the multi-word string as a function of the prosodic groupings and assigned pitch.
7 Assignments
0 Petitions
Accused Products
Abstract
Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the sysstem user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.
-
Citations
22 Claims
-
1. A method of synthesizing human audible speech from a multi-word string of text, the method comprising the steps of:
-
treating the multi-word string as a single prosodic paragraph by performing the steps of; assigning a pitch to the beginning of the multi-word string that is higher than at the end of the multi-word string; and assigning a pitch to a final end point of the string that is lower than the pitch at any point within the string; including, in the multi-word string, following at least one of the individual words in the multi-word string, the corresponding spelling of the individual word; treating each individual word in the multi-word string as a single word declarative sentence; treating the spelling of each individual word included in the multi-word string as a single word declarative sentence; grouping each individual word and the corresponding spelling of the individual word into a prosodic group within the single prosodic paragraph, the prosodic group having a higher pitch at the beginning of the prosodic group than at the end of said prosodic group; and generating speech from the multi-word string as a function of the prosodic groupings and assigned pitch. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method of synthesizing speech from a segment of text including a first word, comprising the step of:
-
inserting after the first word, the spelling of the first word; and generating speech corresponding to the first word and the spelling of the first word. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification