Apparatus and methods for providing network-based information suitable for audio output
First Claim
1. In a server, a method for providing information suitable for audio output, the method comprising:
- receiving a web page including a first set of information over a network based on a request for the first set of information, receiving the first set of information further comprising;
receiving speech information specifying the first set of information;
generating a text request for the first set of information based on an acoustic speech recognition (ASR) technique applied to the speech information, generating including interpreting at least one primitive construct based on the speech information and generating at least one additional primitive construct based on a request for a user-defined command, andsubmitting the text request over the network;
accessing a tagged document in response to receiving the first set of information, the tagged document defined as an XML filtering document, accessing the tagged document further including;
determining an identity of the request for the first set of information; and
accessing the tagged document based on the identity of the request, wherein the identity of the request is based on at least one of an identifier for an originator of the request and an identifier for a destination of the request; and
generating a second set of information including subsets of the web page suitable for audio output based on the first set of information and the tagged document, generating the second set of information suitable for audio output further comprising;
selecting, based on predetermined expected patterns in the filtering document, at least one portion of the first set of information that is suitable for audio output; and
generating the second set of information based on selecting the at least one portion of the first set of information.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention is directed to techniques for navigating a network based on audio input to retrieve information from a network. A user enters audio commands into a two-way communication device to access information located on a network, such as the Internet. For example, a user enters a voice request for a web page into a telephone, which sends the request to a proxy browser for the World Wide Web which in turn provides the request to a web navigation application executing on an application server. The web navigation application generates a text-based request based on the voice request and retrieves a web page from the World Wide Web based on the text-based request. The web navigation application uses a script or an XML page to generate a file suitable for audio output from the retrieved web page. The web navigation application then produces an audio output file from the generated file, which it sends to the proxy browser to provide audio output signals to the user over the telephony connection to the user'"'"'s telephone.
-
Citations
35 Claims
-
1. In a server, a method for providing information suitable for audio output, the method comprising:
-
receiving a web page including a first set of information over a network based on a request for the first set of information, receiving the first set of information further comprising; receiving speech information specifying the first set of information; generating a text request for the first set of information based on an acoustic speech recognition (ASR) technique applied to the speech information, generating including interpreting at least one primitive construct based on the speech information and generating at least one additional primitive construct based on a request for a user-defined command, and submitting the text request over the network; accessing a tagged document in response to receiving the first set of information, the tagged document defined as an XML filtering document, accessing the tagged document further including; determining an identity of the request for the first set of information; and accessing the tagged document based on the identity of the request, wherein the identity of the request is based on at least one of an identifier for an originator of the request and an identifier for a destination of the request; and generating a second set of information including subsets of the web page suitable for audio output based on the first set of information and the tagged document, generating the second set of information suitable for audio output further comprising; selecting, based on predetermined expected patterns in the filtering document, at least one portion of the first set of information that is suitable for audio output; and generating the second set of information based on selecting the at least one portion of the first set of information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for providing information suitable for audio output, the system comprising:
-
a document database configured for storing a plurality of tagged documents; and a server comprising an executable resource, wherein the executable resource; receives a web page including a first set of information over a network based on a request for the first set of information, the executable resource further operable to generate a text request for the first set of information based on an acoustic speech recognition (ASR) technique applied to the speech information, and submits the text request over the network, generating the text request further including; receiving speech information specifying the first set of information Interpreting at least one primitive construct based on the speech information; and generating at least one additional primitive construct based on a request for a user-defined command; accesses a tagged document defined as an XML filtering document from the document database based on receiving the first set of information by determining an identity of the request for the first set of information; and accessing the tagged document based on the identity of the request, wherein the identity of the request is based on at least one of an identifier for an originator of the request and an identifier for a destination of the request; and generates the second set of information including subsets of the web page suitable for audio output based on the first set of information and the tagged document, such that the executable resource selects, based on predetermined expected patterns in the filtering document, at least one portion of the first set of information that is suitable for audio output, and generates the second set of information based on selecting the at least one portion of the first set of information. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer program product embodied on a computer readable storage medium having instructions stored thereon operable for execution by a processor for providing information suitable for audio output, such that the instructions when carried out by a computer, cause the computer to perform the steps of:
-
receiving a web page including a first set of information over a network based on a request for the first set of information, receiving the first set of information further comprising; receiving speech information specifying the first set of information; generating a text request for the first set of information based on an acoustic speech recognition (ASR) technique applied to the speech information, generating including interpreting at least one primitive construct based on the speech information and generating at least one additional primitive construct based on a request for a user-defined command, and submitting the text request over the network; accessing a tagged document defined as an XML filtering document in response to receiving the first set of information, accessing the tagged document further including; determining an identity of the request for the first set of information; and accessing the tagged document based on the identity of the request, wherein the identity of the request is based on at least one of an identifier for an originator of the request and an identifier for a destination of the request; and generating a second set of information including subsets of the web page suitable for audio output based on the first set of information and the tagged document, generating the second set of information suitable for audio output further comprising; selecting, based on predetermined expected patterns in the filtering document, at least one portion of the first set of information that is suitable for audio output; and generating the second set of information based on selecting the at least one portion of the first set of information. - View Dependent Claims (22)
-
-
23. A computer program product embodied on a computer readable storage medium having an encoded set of processor based instructions operable for execution by a processor for performing a method of providing information suitable for audio output, such that the instructions, when carried out by a computer, cause the computer to perform the steps of:
-
receiving a web page including a first set of information over a network based on a request for the first set of information, receiving the first set of information further comprising; receiving speech information specifying the first set of information; generating a text request for the first set of information based on an acoustic speech recognition (ASR) technique applied to the speech information, generating including interpreting at least one primitive construct based on the speech information and generating at least one additional primitive construct based on a request for a user-defined command, and submitting the text request over the network; accessing a tagged document defined as an XML filtering document in response to receiving the first set of information, accessing the tagged document further including; determining an identity of the request for the first set of information; and accessing the tagged document based on the identity of the request, wherein the identity of the request is based on at least one of an identifier for an originator of the request and an identifier for a destination of the request; and generating a second set of information including subsets of the web page suitable for audio output based on the first set of information and the tagged document, generating the second set of information suitable for audio output further comprising; selecting, based on predetermined expected patterns in the filtering document, at least one portion of the first set of information that is suitable for audio output; and generating the second set of information based on selecting the at least one portion of the first set of information. - View Dependent Claims (24)
-
-
25. A system for providing information suitable for audio output, the system comprising:
-
a document database configured for storing a plurality of tagged document pages; means for producing a second set of information suitable for audio output, wherein the producing means receives a web page including a first set of information over a network based on a request for the first set of information, receiving the first set of information further comprising; receiving speech information specifying the first set of information; generating a text request for the first set of information based on an acoustic speech recognition (ASR) technique applied to the speech information, generating including interpreting at least one primitive construct based on the speech information and generating at least one additional primitive construct based on a request for a user-defined command, and submitting the text request over the network; accesses a tagged document defined as an XML filtering document from the document database based on receiving the first set of information by; determining an identity of the request for the first set of information; and accessing the tagged document based on the identity of the request, wherein the identity of the request is based on at least one of an identifier for an originator of the request and an identifier for a destination of the request; and generating the second set of information including subsets of the web page suitable for audio output based on the first set of information and the tagged document, generating the second set of information suitable for audio output comprises; selecting, based on predetermined expected patterns in the filtering document, at least one portion of the first set of information that is suitable for audio output; and generating the second set of information based on selecting the at least one portion of the first set of information. - View Dependent Claims (26)
-
-
27. A method for navigating a web by voice in a server configured for executing voice web applications, the method comprising:
-
requesting a web page including a first set of information based on a voice web navigation request, requesting the web page further comprising; receiving speech information specifying the first set of information; generating a text request for the first set of information based on an acoustic speech recognition (ASR) technique applied to the speech information, generating including interpreting at least one primitive construct based on the speech information and generating at least one additional primitive construct based on a request for a user-defined command, and submitting the text request over the network; receiving a retrieved web page based on the voice web navigation request; accessing a tagged document defined as an XML filtering document page in response to receiving the retrieved web page, accessing the tagged document further including; determining an identity of the request for the first set of information; and accessing the tagged document based on the identity of the request, wherein the identity of the request is based on at least one of an identifier for an originator of the request and an identifier for a destination of the request; generating filtered web content including subsets of the web page suitable for audio output based on the retrieved web page and the extensible markup language page; and generating the at least one audio output file based on the filtered web content, generating audio output file further comprising; selecting, based on predetermined expected patterns in the filtering document, at least one portion of the retrieved web page that is suitable for audio output; and generating the audio output file based on selecting the at least one portion of the first set of information. - View Dependent Claims (28, 29, 30, 31)
-
-
32. A method for voice-based navigation in a server configured for executing voice web applications comprising:
-
receiving a voice-based request to navigate the web from an audio communication device operable to provide the voice-based request in response to a menu generated based on a specific application-defining document operable to provide parameters and options; associating the voice-based request with the specific application-defining document; searching for primitive constructs in the voice-based request; constructing a text-based request based on the primitive constructs identified from the voice-based request; generating the text-based request to navigate the web based on the primitive constructs in the voice-based request from at least one of a database and a proxy server; requesting the web page using the text-based web navigation request by posting a generated URL to a web server to execute the request for the web page; receiving the requested web page from the web server; accessing a tagged document defined as an XML filtering document page from an application document database using the application-defining document associated with the voice-based request, the filtering document page employing a markup language and operable to filter the retrieved web page to provide generated content suitable for audio output, the requesting the web page and accessing the filtering document occurring in a substantially concurrent time frame; generating the filtered web content including subsets of the web page from the retrieved web page and the filtering document page indicated by the application-defining document associated with the voice-based request; generating at least one audio output file based on the filtered web content via a text-to-speech (TTS) technique operable to convert the text in the filtered web content to audio output files, generating the audio output file further comprising; selecting, based on predetermined expected patterns in the filtering document, at least one portion of the retrieved web page that is suitable for audio output; and generating the audio output file based on selecting the at least one portion of the first set of information; and sending the signals via a network connection to the user audio communication device. - View Dependent Claims (33, 34, 35)
-
Specification