Multi-modal voice-enabled content access and delivery system
First Claim
1. A multi-modal content access and delivery system comprising:
- a data session server connected to a network and to user devices requesting content from at least one back-end application via the network, the data session server being configured to maintain only one back-end session with the back-end application, and to maintain multiple sessions for respective ones of the user devices and different modes of communication employed by the user devices which are all accessing the same back-end session to interact with the requested content simultaneously, the user devices accessing the back-end application via speech employing a speech interface to the data session server;
wherein the data session server is configured to receive a mark-up language page from the back-end application, the mark-up language page comprising directives for different modes of communication specified by the back-end application to format and provide the requested content to the respective ones of the user devices and the different modes of communication employed by the user devices via their corresponding multiple sessions, the data session server also being configured to fill in templates in response to corresponding ones of the directives specified in the mark-up language page for the different modes of communication to create the requested content in accordance with the protocol needed for the different modes of communication of the user devices, the templates comprising at least one of preset templates stored at the data session server and templates obtained by the data session server from the network.
8 Assignments
0 Petitions
Accused Products
Abstract
A voice-enabled system for online content access and delivery provides a voice and telephony interface, as well a text and graphic interface, for browsing and accessing requested content or shopping over the Internet using a browser or a telephone. The system allows customers to access an online data application, search for desired content items, select content items, and finally pay for selected items using a credit card, over a phone line or the Internet. A telephony-Internet interface converts spoken queries into electronic commands for transmission to an online data application. Markup language-type pages transmitted to callers from the online data application are parsed to extract selected information. The selected information is then reported to the callers via audio messaging. A voice-enabled technology for mobile multi-modal interaction is also provided.
153 Citations
14 Claims
-
1. A multi-modal content access and delivery system comprising:
-
a data session server connected to a network and to user devices requesting content from at least one back-end application via the network, the data session server being configured to maintain only one back-end session with the back-end application, and to maintain multiple sessions for respective ones of the user devices and different modes of communication employed by the user devices which are all accessing the same back-end session to interact with the requested content simultaneously, the user devices accessing the back-end application via speech employing a speech interface to the data session server; wherein the data session server is configured to receive a mark-up language page from the back-end application, the mark-up language page comprising directives for different modes of communication specified by the back-end application to format and provide the requested content to the respective ones of the user devices and the different modes of communication employed by the user devices via their corresponding multiple sessions, the data session server also being configured to fill in templates in response to corresponding ones of the directives specified in the mark-up language page for the different modes of communication to create the requested content in accordance with the protocol needed for the different modes of communication of the user devices, the templates comprising at least one of preset templates stored at the data session server and templates obtained by the data session server from the network. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of providing multi-modal voice-enabled access to and delivery of content comprising the steps of:
-
creating multiple sessions to receive requests for content from different user interfaces and different modes of communication employed by a user interface; creating a back-end session with a back-end application that can provide the requested content, the different user interfaces and the different modes of communication employed by a user interface all accessing the back-end session to interact with the requested content simultaneously, the user interfaces accessing the back-end application via speech employing a speech interface to access the back-end session; receiving a mark-up language page comprising directives from the back-end application via the back-end session for interacting with the different user interfaces and the different modes of communication employed by a user interface, the directives comprising templates identified by the back-end application using a Universal Resource Locator (URL) address; retrieving templates comprising at least one of obtaining the templates from a network using the URL address and retrieving from a memory a preset template; and completing corresponding ones of the templates to provide the requested data in accordance with the protocol needed for at least two or more of the different user interfaces and the different modes of communication employed by a user interface. - View Dependent Claims (12, 13, 14)
-
Specification