Apparatus and methods for providing an audibly controlled user interface for audio-based communication devices
First Claim
1. A method in a browser for providing an audibly controlled user interface for a limited communication device, the steps comprising:
- receiving speech input information over an interface connection capable of two-way communication with the limited communication device;
generating at least one key chunk of information based on the speech input information;
generating an audio output developed from a response document based on the at least one key chunk of information; and
providing the audio output over the interface connection to the limited communication device in response to generating the audio output, wherein the audio output comprises;
providing the at least one key chunk of information to a web application; and
receiving the response document from the web application, the response document developed from an application-defining document accessed in response to the at least one key chunk of information provided to the web application;
wherein the step of generating the at least one key chunk of information comprises generating the at least one key chunk of information by an automatic speech recognition module deriving the at least one key chunk of information from the speech input information and further comprising;
sending and receiving web pages to and from an application server according to a hypertext transfer protocol;
parsing XML tags appearing within web pages received by the browser;
Performing basic telephony functions including detecting an on-hook condition and an off-hook condition of a user device and sending and receiving audio signals to and from the user device; and
selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address and a specification of capabilities of the user device, the specification of capabilities including whether the user device accepts only text data whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audio signals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention is directed to techniques for providing an audibly controlled interface for a user of a limited audio-based communication device, for example, a telephony device such as a desktop telephone or a cellular telephone. The communication device has an interface connection with a proxy browser. The user initially accesses the device, such as by picking up the handset, and the proxy browser provides a communication path over a network to a call services application on an application server. The application server provides a response to the initial access signal. The proxy browser receives the response from the application server and plays back an audio output based on the response to the communication device for the user. The user can then respond with a request to the call services application to place an outbound call or to initiate another service provided by the application server via the proxy browser.
67 Citations
15 Claims
-
1. A method in a browser for providing an audibly controlled user interface for a limited communication device, the steps comprising:
-
receiving speech input information over an interface connection capable of two-way communication with the limited communication device; generating at least one key chunk of information based on the speech input information; generating an audio output developed from a response document based on the at least one key chunk of information; and providing the audio output over the interface connection to the limited communication device in response to generating the audio output, wherein the audio output comprises;
providing the at least one key chunk of information to a web application; andreceiving the response document from the web application, the response document developed from an application-defining document accessed in response to the at least one key chunk of information provided to the web application; wherein the step of generating the at least one key chunk of information comprises generating the at least one key chunk of information by an automatic speech recognition module deriving the at least one key chunk of information from the speech input information and further comprising; sending and receiving web pages to and from an application server according to a hypertext transfer protocol; parsing XML tags appearing within web pages received by the browser; Performing basic telephony functions including detecting an on-hook condition and an off-hook condition of a user device and sending and receiving audio signals to and from the user device; and selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address and a specification of capabilities of the user device, the specification of capabilities including whether the user device accepts only text data whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audio signals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data. - View Dependent Claims (2, 3, 4, 5, 6, 12)
-
-
7. A processor-based system for providing an audibly controlled interface for a limited communication device, the processor-based system comprising:
-
an interface connection capable of two-way communication with the limited communication device; and a proxy browser in communication with the interface connection;
wherein the interface connection receives speech input information and provides the speech input information to the proxy browser;
the proxy browser generates at least one key chunk of information based on the speech input information;the proxy browser generates an audio output developed from a response document based on the at least one key chunk of information and provides the audio output to the interface connection; and the interface connection provides the audio output to the limited communication device, wherein the audio output comprises;
providing the at least one key chunk of information to a web application; and
receiving the response document from the web application, the response document developed from an application-defining document accessed in response to the at least one key chunk of information provided to the web application;an automatic speech recognition module, wherein the automatic speech recognition module derives the at least one key chunk of information from the speech input information received over the interface connection; wherein the proxy browser comprises; a web browser configured for sending and receiving web pages to and from an application server according to a hypertext transfer protocol; an XML parser operative to parse XML tags appearing within web pages received by the web browser a device interface operative to perform basic telephony functions including detecting an on-hook condition and an off-hook condition of a user device, and sending and receiving audio signals to and from the user device; and a voice resource control configured for selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address, and a specification of capabilities of the user device, the specification of capabilities including whether the user device accepts only text data, whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audio signals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data. - View Dependent Claims (8, 9, 10, 11)
-
-
13. A processor-based system for providing an audibly controlled interface for a limited communication device, the processor-based system comprising:
-
an interface connection capable of two-way communication with the limited communication device; and means for generating an audio output, the generating means in communication with the interface connection, wherein the interface connection receives speech input information and provides the speech input information to the generating means;
the generating means generates at least one key chunk of information based on the speech input information;the generating means generates an audio output developed from a response document based on the at least one key chunk of information and provides the audio output to the interface connection, wherein the audio output comprises;
providing the at least one key chunk of information to a web application; and
receiving the response document from the web application, the response document developed from an application-defining document accessed in response to the at least one key chunk of information provided to the web application;the interface connection provides the audio output to the limited communication device; and an automatic speech recognition module, wherein the automatic speech recognition module derives the at least one key chunk of information from the speech input information received over the interface connection wherein the generating means comprises; means for sending and receiving web pages to and from an application server according to a hypertext transfer protocol; means for parsing XML tags appearing within web pages received by the web browser; means for performing basic telephony functions including detecting an on-hook condition and an off-hook condition of a user device, and sending and receiving audio signals to and from the user device; and means for selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address, and a specification of capabilities of the user device the specification of capabilities including whether the user device accepts only text data, whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audio signals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data.
-
-
14. A computer program product that stored on a computer readable storage medium having instructions stored thereon for providing an audibly controlled interface for a limited communication device, such that the instructions, when carried out by a computer, cause the computer to perform the steps of:
-
receiving speech input information over an interface connection capable of two-way communication with the limited communication device; generating at least one key chunk of information based on the speech input information; generating an audio-output developed from a response document based on the at least one key chunk of information; and providing the audio output over the interface connection to the limited communication device in response to generating the audio output, wherein the audio output comprises;
providing the at least one key chunk of information to a web application; andreceiving the response document from the web application, the response document developed from an application-defining document accessed in response to the at least one key chunk of information provided to the web application; wherein the step of generating the at least one key chunk of information comprises generating the at least one key chunk of information by an automatic speech recognition module deriving the at least one key chunk of information from the speech input information and further cause the computer to perform the steps of; sending and receiving web pages to and from an application server according to a hypertext transfer protocol; Parsing XML tags appearing within web pages received by the browser; performing basic telephony functions including detecting an on-hook condition and an off-hook condition of a user device, and sending and receiving audio signals to and from the user device; and selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address, and a specification of capabilities of the user device, the specification of capabilities including whether the user device accepts only text data, whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audio signals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data.
-
-
15. A method in a browser for providing an audibly controlled user interface for a limited communication device, the steps comprising:
- receiving speech input information including at least one of an input indicating an initial access to the
limited communication device over a telephony connection, command for storing data, a command for retrieving data, and a command for placing an outbound telephony call; generating the at least one key chunk of information by an automatic speech recognition module deriving the at least one key chunk of information from the speech input information;
generating an audio output developed from a response document based on the at least one key chunk of information to a web application and receiving the response document form the web application, the response document developed from an application-defining document access in response to the at least one key chunk of information provided to the web application; and
providing the audio over the telephony connection to the limited communication device in response to generating the audio output;
sending and receiving web pages to and from an application server according to a hypertext transfer protocol;
parsing XML tags appearing within web pages received by the browser;
performing basis telephony functions including detecting an on-hook condition and an off-hook condition of a user device, and sending and receiving audio signals to and from the user device; and
selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address, and aspecification of capabilities of the user device, the specification of capabilities including whether the user device accepts only text data, whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audio signals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data.
- receiving speech input information including at least one of an input indicating an initial access to the
Specification