PHONETIC DECODING AND CONCATENTIVE SPEECH SYNTHESIS
First Claim
1. A speech processing system for receiving speech data from a speaker during a conversation turn in a conversation session, said speech processing system comprising:
- a phoneme recognition engine for converting the received speech data to an input string of acoustic data;
a phoneme modification engine for changing at least one item of acoustic data in said input string according to one or more rules to form at least one output string of acoustic data; and
a phoneme speech engine for converting each formed output string of acoustic data to output speech data for output to at least one listener.
8 Assignments
0 Petitions
Accused Products
Abstract
A speech processing system includes a multiplexer that receives speech data input as part of a conversation turn in a conversation session between two or more users where one user is a speaker and each of the other users is a listener in each conversation turn. A speech recognizing engine converts the speech data to an input string of acoustic data while a speech modifier forms an output string based on the input string by changing an item of acoustic data according to a rule. The system also includes a phoneme speech engine for converting the first output string of acoustic data including modified and unmodified data to speech data for output via the multiplexer to listeners during the conversation turn.
223 Citations
17 Claims
-
1. A speech processing system for receiving speech data from a speaker during a conversation turn in a conversation session, said speech processing system comprising:
-
a phoneme recognition engine for converting the received speech data to an input string of acoustic data; a phoneme modification engine for changing at least one item of acoustic data in said input string according to one or more rules to form at least one output string of acoustic data; and a phoneme speech engine for converting each formed output string of acoustic data to output speech data for output to at least one listener. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of processing speech comprising:
-
receiving speech data from a speaker during a conversation turn in a conversation session; converting the received speech data to an input string of acoustic data; changing at least one item of acoustic data in said input string according to one or more rules to form at least one output string of acoustic data; and converting each formed output string of acoustic data to output speech data for output to at least one listener. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer program product for processing speech comprising a computer usable medium having computer usable program code embodied therewith, said computer usable program code comprising:
-
computer usable program code configured to receive the speech data from a speaker during a conversation turn in a conversation session; computer usable program code configured to convert the received speech data to an input string of acoustic data; computer usable program code configured to change at least one item of acoustic data in said input string according to one or more rules to form at least one output string of acoustic data; and computer usable program code configured to convert each formed output string of acoustic data to output speech data for output to at least one listener. - View Dependent Claims (14, 15, 16, 17)
-
Specification