Prosodic speech text codes and their use in computerized speech systems
First Claim
1. A computer-implemented method of synthesizing speech from text, the method comprising:
- inputting text to be synthesized to a computerized system;
electronically marking the text with electronic versions of one or more prosodic graphical symbols, the electronically marked text being displayable or printable as human-readable text marked with the prosodic graphical symbols, to indicate to a speaker desired speech characteristics to be employed in speaking the text, wherein the prosodic graphical symbols indicate a desired prosody and include intelligibility pronunciation notations in sequence with the text and pitch change notations in sequence with the text; and
generating a synthesized speech output comprising phonetic data corresponding with the marked up text and having the prosody indicated by the prosodic graphical symbols.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of, and system for, acoustically coding text for use in the synthesis of speech from the text. The method includes marking the text to be spoken with one or more graphic symbols to indicate to a speaker a desired prosody to impart to the spoken text. The markups can include grapheme-phoneme pairs each wherein a visible prosodic-indicating grapheme is employed with written text and a corresponding digital phoneme is functional in the digital domain. The invention is useful in the generation of appealing, humanized machine speech for a wide range of applications, including voice mail systems, electronically enabled appliances, automobiles, computers, robotic assistants, games and the like, in spoken books and magazines, drama and other entertainment.
19 Citations
20 Claims
-
1. A computer-implemented method of synthesizing speech from text, the method comprising:
-
inputting text to be synthesized to a computerized system; electronically marking the text with electronic versions of one or more prosodic graphical symbols, the electronically marked text being displayable or printable as human-readable text marked with the prosodic graphical symbols, to indicate to a speaker desired speech characteristics to be employed in speaking the text, wherein the prosodic graphical symbols indicate a desired prosody and include intelligibility pronunciation notations in sequence with the text and pitch change notations in sequence with the text; and generating a synthesized speech output comprising phonetic data corresponding with the marked up text and having the prosody indicated by the prosodic graphical symbols. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computerized speech synthesizer for synthesizing speech from text comprising a data processing unit, data memory, and a text input for inputting text to be synthesized to the speech synthesizer, and a speech synthesis program for:
-
electronically marking text input to the speech synthesizer with electronic versions of one or more prosodic graphical symbols, the electronically marked text being displayable or printable as human-readable text marked with the prosodic graphical symbols, to indicate to a speaker a desired speech characteristics to be employed in speaking the text, wherein the prosodic graphical symbols indicate a desired prosody and include intelligibility pronunciation notations in sequence with the text and pitch change notations in sequence with the text; and generating a synthesized speech output comprising phonetic data corresponding with the marked up text and having the prosody indicated by the prosodic graphical symbols.
-
Specification