Apparatus and methods for providing an audibly controlled user interface for audio-based communication devices

US 7,308,484 B1
Filed: 06/30/2000
Issued: 12/11/2007
Est. Priority Date: 06/30/2000
Status: Expired due to Fees

First Claim

Patent Images

1. A method in a browser for providing an audibly controlled user interface for a limited communication device, the steps comprising:

receiving speech input information over an interface connection capable of two-way communication with the limited communication device;

generating at least one key chunk of information based on the speech input information;

generating an audio output developed from a response document based on the at least one key chunk of information; and

providing the audio output over the interface connection to the limited communication device in response to generating the audio output, wherein the audio output comprises;

providing the at least one key chunk of information to a web application; and

receiving the response document from the web application, the response document developed from an application-defining document accessed in response to the at least one key chunk of information provided to the web application;

wherein the step of generating the at least one key chunk of information comprises generating the at least one key chunk of information by an automatic speech recognition module deriving the at least one key chunk of information from the speech input information and further comprising;

sending and receiving web pages to and from an application server according to a hypertext transfer protocol;

parsing XML tags appearing within web pages received by the browser;

Performing basic telephony functions including detecting an on-hook condition and an off-hook condition of a user device and sending and receiving audio signals to and from the user device; and

selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address and a specification of capabilities of the user device, the specification of capabilities including whether the user device accepts only text data whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audio signals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The invention is directed to techniques for providing an audibly controlled interface for a user of a limited audio-based communication device, for example, a telephony device such as a desktop telephone or a cellular telephone. The communication device has an interface connection with a proxy browser. The user initially accesses the device, such as by picking up the handset, and the proxy browser provides a communication path over a network to a call services application on an application server. The application server provides a response to the initial access signal. The proxy browser receives the response from the application server and plays back an audio output based on the response to the communication device for the user. The user can then respond with a request to the call services application to place an outbound call or to initiate another service provided by the application server via the proxy browser.

67 Citations

View as Search Results

15 Claims

1. A method in a browser for providing an audibly controlled user interface for a limited communication device, the steps comprising:
- receiving speech input information over an interface connection capable of two-way communication with the limited communication device;
  
  generating at least one key chunk of information based on the speech input information;
  
  generating an audio output developed from a response document based on the at least one key chunk of information; and
  
  providing the audio output over the interface connection to the limited communication device in response to generating the audio output, wherein the audio output comprises;
  
  providing the at least one key chunk of information to a web application; and
  
  receiving the response document from the web application, the response document developed from an application-defining document accessed in response to the at least one key chunk of information provided to the web application;
  
  wherein the step of generating the at least one key chunk of information comprises generating the at least one key chunk of information by an automatic speech recognition module deriving the at least one key chunk of information from the speech input information and further comprising;
  
  sending and receiving web pages to and from an application server according to a hypertext transfer protocol;
  
  parsing XML tags appearing within web pages received by the browser;
  
  Performing basic telephony functions including detecting an on-hook condition and an off-hook condition of a user device and sending and receiving audio signals to and from the user device; and
  
  selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address and a specification of capabilities of the user device, the specification of capabilities including whether the user device accepts only text data whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audio signals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data.
- View Dependent Claims (2, 3, 4, 5, 6, 12)
- - 2. The method of claim 1, wherein the step of receiving the speech input information comprises receiving the speech input information over a telephony connection to the limited communication device;
    - andthe step of providing the audio output over the interface connection comprises providing the audio output over the telephony connection.
  - 3. The method of claim 1, wherein the step of receiving the speech input information comprises receiving an input indicating an initial access to the limited communication device.
  - 4. The method of claim 1, wherein the step of receiving the speech input information comprises receiving at least one of a command for storing data, a command for retrieving data, and a command for placing an outbound telephony call.
  - 5. The method of claim 1 wherein the browser is configured to retrieved web-based documentation containing markup language on behalf of multiple user devices;
    - wherein the response document is a web page containing a markup language reference to a sound file; and
      
      wherein the step of generating the audio output includes the step of;
      
      playing the sound file referenced by the markup language reference contained in the web page in order to generate the audio output.
  - 6. The method of claim 5 wherein the step of receiving the speech input includes the step of obtaining an incoming wireless signal from a wireless user device, the wireless signal carrying the speech input;
    - wherein the step of generating the at least one key chunk of information includes the step of parsing the web page to identify the markup language reference to the sound file; and
      
      wherein the step of providing the audio output includes the step of transmitting an outgoing wireless signal to the wireless user device, the outgoing wireless signal carrying the audio output.
  - 12. The method of claim 5 wherein the step of receiving the speech input includes the step of obtaining an incoming wireless signal from a wireless user device, the wireless signal carrying the speech input;
    - wherein the step of generating the at least one key chunk of information includes the step of parsing the web page to identify the markup language reference to the sound file; and
      
      wherein the step of providing the audio output includes the step of transmitting an outgoing wireless signal to the wireless user device, the outgoing wireless signal carrying the audio output.

7. A processor-based system for providing an audibly controlled interface for a limited communication device, the processor-based system comprising:
- an interface connection capable of two-way communication with the limited communication device; and
  
  a proxy browser in communication with the interface connection;
  
  wherein the interface connection receives speech input information and provides the speech input information to the proxy browser;
  
  the proxy browser generates at least one key chunk of information based on the speech input information;
  
  the proxy browser generates an audio output developed from a response document based on the at least one key chunk of information and provides the audio output to the interface connection; and
  
  the interface connection provides the audio output to the limited communication device, wherein the audio output comprises;
  
  providing the at least one key chunk of information to a web application; and
  
  receiving the response document from the web application, the response document developed from an application-defining document accessed in response to the at least one key chunk of information provided to the web application;
  
  an automatic speech recognition module, wherein the automatic speech recognition module derives the at least one key chunk of information from the speech input information received over the interface connection;
  
  wherein the proxy browser comprises;
  
  a web browser configured for sending and receiving web pages to and from an application server according to a hypertext transfer protocol;
  
  an XML parser operative to parse XML tags appearing within web pages received by the web browser a device interface operative to perform basic telephony functions including detecting an on-hook condition and an off-hook condition of a user device, and sending and receiving audio signals to and from the user device; and
  
  a voice resource control configured for selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address, and aspecification of capabilities of the user device, the specification of capabilities including whether the user device accepts only text data, whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audio signals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data.
- View Dependent Claims (8, 9, 10, 11)
- - 8. The system of claim 7, wherein the interface connection is a telephony connection.
  - 9. The system of claim 7, wherein the speech input information comprises an input indicating an initial access to the limited communication device.
  - 10. The system of claim 7, wherein the speech input information comprises at least one of a command for storing data, a command for retrieving data, and a command for placing an outbound telephony call.
  - 11. The method of claim 7 wherein the browser is configured to retrieved web-based documentation containing markup language on behalf of multiple user devices;
    - wherein the response document is a web page containing a markup language reference to a sound file; and
      
      wherein the step of generating the audio output includes the step of;
      
      playing the sound file referenced by the markup language reference contained in the web page in order to generate the audio output.

13. A processor-based system for providing an audibly controlled interface for a limited communication device, the processor-based system comprising:
- an interface connection capable of two-way communication with the limited communication device; and
  
  means for generating an audio output, the generating means in communication with the interface connection, wherein the interface connection receives speech input information and provides the speech input information to the generating means;
  
  the generating means generates at least one key chunk of information based on the speech input information;
  
  the generating means generates an audio output developed from a response document based on the at least one key chunk of information and provides the audio output to the interface connection, wherein the audio output comprises;
  
  providing the at least one key chunk of information to a web application; and
  
  receiving the response document from the web application, the response document developed from an application-defining document accessed in response to the at least one key chunk of information provided to the web application;
  
  the interface connection provides the audio output to the limited communication device;
  
  and an automatic speech recognition module, wherein the automatic speech recognition module derives the at least one key chunk of information from the speech input information received over the interface connection wherein the generating means comprises;
  
  means for sending and receiving web pages to and from an application server according to a hypertext transfer protocol;
  
  means for parsing XML tags appearing within web pages received by the web browser;
  
  means for performing basic telephony functions including detecting an on-hook condition and an off-hook condition of a user device, and sending and receiving audio signals to and from the user device; and
  
  means for selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address, and a specification of capabilities of the user device the specification of capabilities including whether the user device accepts only text data, whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audio signals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data.

14. A computer program product that stored on a computer readable storage medium having instructions stored thereon for providing an audibly controlled interface for a limited communication device, such that the instructions, when carried out by a computer, cause the computer to perform the steps of:
- receiving speech input information over an interface connection capable of two-way communication with the limited communication device;
  
  generating at least one key chunk of information based on the speech input information;
  
  generating an audio-output developed from a response document based on the at least one key chunk of information; and
  
  providing the audio output over the interface connection to the limited communication device in response to generating the audio output, wherein the audio output comprises;
  
  providing the at least one key chunk of information to a web application; and
  
  receiving the response document from the web application, the response document developed from an application-defining document accessed in response to the at least one key chunk of information provided to the web application;
  
  wherein the step of generating the at least one key chunk of information comprises generating the at least one key chunk of information by an automatic speech recognition module deriving the at least one key chunk of information from the speech input information and further cause the computer to perform the steps of;
  
  sending and receiving web pages to and from an application server according to a hypertext transfer protocol;
  
  Parsing XML tags appearing within web pages received by the browser;
  
  performing basic telephony functions including detecting an on-hook condition and an off-hook condition of a user device, and sending and receiving audio signals to and from the user device; and
  
  selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address, and a specification of capabilities of the user device, the specification of capabilities including whether the user device accepts only text data, whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audio signals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data.

15. A method in a browser for providing an audibly controlled user interface for a limited communication device, the steps comprising:
- receiving speech input information including at least one of an input indicating an initial access to thelimited communication device over a telephony connection, command for storing data, a command for retrieving data, and a command for placing an outbound telephony call;
  
  generating the at least one key chunk of information by an automatic speech recognition module deriving the at least one key chunk of information from the speech input information;
  
  generating an audio output developed from a response document based on the at least one key chunk of information to a web application and receiving the response document form the web application, the response document developed from an application-defining document access in response to the at least one key chunk of information provided to the web application; and
  
  providing the audio over the telephony connection to the limited communication device in response to generating the audio output;
  
  sending and receiving web pages to and from an application server according to a hypertext transfer protocol;
  
  parsing XML tags appearing within web pages received by the browser;
  
  performing basis telephony functions including detecting an on-hook condition and an off-hook condition of a user device, and sending and receiving audio signals to and from the user device; and
  
  selectively implementing hypertext markup language (HTML) and XML tags appearing within the web pages received by the browser based on capabilities of the user device, the capabilities being stored in a device capabilities table which includes for the user device a unique device identifier, a network address selected from a telephone number and an IP address, and aspecification of capabilities of the user device, the specification of capabilities including whether the user device accepts only text data, whether the user device is able to respond to multiple prompts, whether the user device accepts digital audio data or only analog audio data, whether the user device has a microphone for generating analog audiosignals, and whether the user device has an analog to digital converter for converting the analog audio signals to digital audio data.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Cisco Technology, Inc. (Cisco Systems, Inc.)
Original Assignee
Cisco Technology, Inc. (Cisco Systems, Inc.)
Inventors
Martin, Steven J., Danner, Ryan A., Dodrill, Lewis D.
Primary Examiner(s)
Vaughn; William
Assistant Examiner(s)
Nguyen; Thanh T.

Application Number

US09/608,232
Time in Patent Office

2,720 Days
Field of Search

709/218, 709/217, 709/226, 709/223, 704/223, 704/270, 704/275, 704/217, 704/233, 279/88.22
US Class Current

709/218
CPC Class Codes

G06F 16/252   between a Database Manageme...

G06F 16/9574   of access to content, e.g. ...

H04M 1/72445   for supporting Internet bro...

H04M 2201/40   using speech recognition sp...

H04M 2250/74   with voice recognition mean...

H04M 3/4938   comprising a voice browser ...

Apparatus and methods for providing an audibly controlled user interface for audio-based communication devices

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

67 Citations

15 Claims

Specification

Solutions

Use Cases

Quick Links

Apparatus and methods for providing an audibly controlled user interface for audio-based communication devices

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

67 Citations

15 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links