Speech processing for telephony API
First Claim
1. A computing device to enhance media processing of a text stream, comprising:
- a terminal data structure to instantiate terminal objects;
a speech generation terminal data structure that extends the terminal data structure;
a module to instantiate, based on the terminal data structure and the speech generation terminal data structure, terminal objects including a speech terminal object to generate speech; and
a telephony API provided by a TAPI application component to form a connection with a client and to process the text stream by selecting the speech terminal object from among a group of registered terminal objects adhering to a uniform interface and by providing the text stream to the selected speech terminal object, wherein various programmers can provide registerable telephony services using terminal objects that rely on the telephony API, the TAPI application component for registering terminal objects selectable for instantiation with a terminal manager by providing a terminal class name, a media type, and a procedure of signaling events and for providing a list of registered terminal objects including the media type of a registered terminal object.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, methods, and structures are discussed that enhance media processing. One aspect of the present invention includes a data structure to enhance media processing. The data structure includes a terminal data structure to instantiate terminal objects and a speech recognition terminal data structure that extends the terminal data structure. Another aspect of the present invention includes a data structure to enhance media processing. This data structure includes a terminal data structure to instantiate terminal objects and a speech generation terminal data structure that extends the terminal data structure. These data structures may be used to implement an internet protocol interactive voice response system, an internet protocol unified message system, and speech-enabled Web applications.
60 Citations
22 Claims
-
1. A computing device to enhance media processing of a text stream, comprising:
-
a terminal data structure to instantiate terminal objects; a speech generation terminal data structure that extends the terminal data structure; a module to instantiate, based on the terminal data structure and the speech generation terminal data structure, terminal objects including a speech terminal object to generate speech; and a telephony API provided by a TAPI application component to form a connection with a client and to process the text stream by selecting the speech terminal object from among a group of registered terminal objects adhering to a uniform interface and by providing the text stream to the selected speech terminal object, wherein various programmers can provide registerable telephony services using terminal objects that rely on the telephony API, the TAPI application component for registering terminal objects selectable for instantiation with a terminal manager by providing a terminal class name, a media type, and a procedure of signaling events and for providing a list of registered terminal objects including the media type of a registered terminal object. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-readable storage medium to enhance media processing, comprising:
-
a terminal data structure to instantiate terminal objects; a speech generation terminal data structure that extends the terminal data structure, wherein the speech generation terminal data structure includes a method member speak for synthesizing text to audio; and a telephony API provided by a TAPI application component to form a connection with a client in response to receiving a call from the client, and to process text by selecting a speech terminal object from among a group of registered terminal objects adhering to a uniform interface and by providing the text to the selected speech terminal object for converting text to speech that is provided to the client via the received call, the TAPI application component for registering terminal objects selectable for instantiation with a terminal manager by providing a terminal class name, a media type, and a procedure of signaling events and for providing a list of registered terminal objects including the media type of a registered terminal object. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A method for enhancing media processing, comprising:
-
invoking a telephony API to listen for incoming calls, wherein the telephony API is provided by a TAPI application component, the TAPI application component for registering terminal objects that are selectable for instantiation with a terminal manager by providing a terminal class name, a media type, and a method of signaling events and for providing a list of registered terminal objects including the media types of the registered terminal objects; receiving a call from a client; upon receiving the call, invoking the telephony API to form a connection with the client and to select a speech generation terminal object from among a group of registered terminal objects adhering to a uniform interface, the selected speech generation terminal object for generating speech from a text stream; invoking a method of the terminal object to generate speech; and invoking the telephony API to communicate the generated speech to the client. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A system to enhance processing of a text stream, comprising:
-
a terminal object to render the text stream into speech; and a telephony API provided by a TAPI application component to form a connection with a client and to process the text stream by selecting the speech terminal object from among a group of registered terminal objects adhering to a uniform interface and by providing the text stream to the selected speech terminal object, the TAPI application component for registering terminal objects that are selectable for instantiation with a terminal manager by providing a terminal class name, a media type, and a procedure of signaling events and for providing a list of registered terminal objects including the media type of a registered terminal object.
-
Specification