System for speech-enabled web applications
First Claim
1. A system to enhance speech-enabled Web applications, comprising:
- a Web page that includes voice tags;
a voice browser;
a terminal object to interpret the voice tags; and
a telephony API used by the voice browser to form a connection with a client and process the voice tags by selecting the terminal object from among a group of registered terminal objects adhering to a uniform interface and providing the voice tags to the selected terminal object, the selected terminal object for interpreting the voice tags, wherein various programmers can provide registerable telephony services using terminal objects that rely on the telephony API, wherein the telephony API is provided by a TAPI application component, the TAPI application component for registering pluggable terminal objects with a terminal manager by providing a terminal class name, a media type, and a method of signaling events and for providing a list of registered terminal objects including the registered media type.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, methods, and structures are discussed that enhance media processing. One aspect of the present invention includes a data structure to enhance media processing. The data structure includes a terminal data structure to instantiate terminal objects and a speech recognition terminal data structure that extends the terminal data structure. Another aspect of the present invention includes a data structure to enhance media processing. This data structure includes a terminal data structure to instantiate terminal objects and a speech generation terminal data structure that extends the terminal data structure. These data structures may be used to implement an internet protocol interactive voice response system, an internet protocol unified message system, and speech-enabled Web applications.
78 Citations
19 Claims
-
1. A system to enhance speech-enabled Web applications, comprising:
-
a Web page that includes voice tags; a voice browser; a terminal object to interpret the voice tags; and a telephony API used by the voice browser to form a connection with a client and process the voice tags by selecting the terminal object from among a group of registered terminal objects adhering to a uniform interface and providing the voice tags to the selected terminal object, the selected terminal object for interpreting the voice tags, wherein various programmers can provide registerable telephony services using terminal objects that rely on the telephony API, wherein the telephony API is provided by a TAPI application component, the TAPI application component for registering pluggable terminal objects with a terminal manager by providing a terminal class name, a media type, and a method of signaling events and for providing a list of registered terminal objects including the registered media type. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for web-based navigation using voice commands, comprising:
-
invoking a telephony API to listen for incoming calls, wherein the telephony API is provided by a TAPI application component, the TAPI application component for registering pluggable terminal objects with a terminal manager by providing a terminal class name, a media type, and a method of signaling events and for providing a list of registered terminal objects including the registered media type; receiving a call from a client; receiving a voice command from the client requesting navigation to a web page hosted by a web server; invoking the telephony API to instantiate a terminal object suitable for interpreting the voice command and converting the voice command to a web-based navigation command; and sending the web-based navigation command to the web server hosting the web wherein the telephony API provides functions to form a connection with a client and process voice commands by selecting a terminal object from among a group of registered terminal objects adhering to a uniform interface, the selected terminal object for interpreting the voice commands, wherein various programmers can provide registerable telephony services using terminal objects that rely on the telephony API. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable medium encoded with computer-executable instructions for describing a web page to a client accessing the web page by phone, by a method comprising:
-
receiving a web page from a web server; invoking a telephony API to instantiate a terminal object suitable for interpreting the web page and rendering the web page into speech, wherein the telephony API is provided by a TAPI application component, the TAPI application component for registering pluggable terminal objects with a terminal manager by providing a terminal class name, a media type, and a method of signaling events and for providing a list of registered terminal objects including the registered media type; and sending the speech to the client connected through the telephony API, wherein the telephony API provides functions to form a connection with the client and render web pages into speech by selecting a terminal object from among a group of registered terminal objects adhering to a uniform interface, the selected terminal object for rendering web pages into speech, wherein various programmers can provide registerable telephony services using terminal objects that rely on the telephony API. - View Dependent Claims (16, 17, 18, 19)
-
Specification