Multi-modal web interaction over wireless network
First Claim
Patent Images
1. A method comprising:
- receiving, at a server, a session message from a client device via a network, the session message requesting establishment of a session with the server and comprising (i) a voice type requested by the client device and (ii) a user language requested by the client device;
determining, at the server, if the requested voice type and the requested user language are supported by the server;
sending, by the server, a ready message to the client device via the network in response to determining that the requested voice type and the requested user language are supported by the server;
receiving, at the server, a client request from the client device via the network subsequent to sending the ready message to the client device, the client request including a focused group of hyperlinks and speech data;
interpreting the client request to identify a selection of at least one of a plurality of web interaction modes, at least one web interaction mode being a speech interaction mode; and
building a correct grammar for speech recognition based on the speech data and the focused group of hyperlinks, performing speech recognition, and performing specific tasks according to the result of the speech recognition.
0 Assignments
0 Petitions
Accused Products
Abstract
A system, apparatus, and method is disclosed for receiving user input at a client device, interpreting the user input to identify a selection of at least one of a plurality of web interaction modes, producing a corresponding client request based in part on the user input and the web interaction mode; and sending the client request to a server via a network.
52 Citations
20 Claims
-
1. A method comprising:
-
receiving, at a server, a session message from a client device via a network, the session message requesting establishment of a session with the server and comprising (i) a voice type requested by the client device and (ii) a user language requested by the client device; determining, at the server, if the requested voice type and the requested user language are supported by the server; sending, by the server, a ready message to the client device via the network in response to determining that the requested voice type and the requested user language are supported by the server; receiving, at the server, a client request from the client device via the network subsequent to sending the ready message to the client device, the client request including a focused group of hyperlinks and speech data; interpreting the client request to identify a selection of at least one of a plurality of web interaction modes, at least one web interaction mode being a speech interaction mode; and building a correct grammar for speech recognition based on the speech data and the focused group of hyperlinks, performing speech recognition, and performing specific tasks according to the result of the speech recognition. - View Dependent Claims (3, 4, 5, 6, 7, 8, 16, 17, 18, 19, 20)
-
-
2. A non-transitory machine-readable medium having stored thereon data representing instructions which, when executed by a machine, cause the machine to perform operations, comprising:
-
receiving a session message from a client device via a network, the session message requesting establishment of a session and comprising (i) a voice type requested by the client device and (ii) a user language requested by the client device; determining if the requested voice type and the requested user language are supported; sending a ready message to the client device via the network in response to determining that the requested voice type and the requested user language are supported; receiving a client request from the client device via the network subsequent to sending the ready message to the client device, the client request including a focused group of hyperlinks and speech data; interpreting the client request to identify a selection of at least one of a plurality of web interaction modes, at least one web interaction mode being a speech interaction mode; and building a correct grammar for speech recognition based on the speech data and the focused group of hyperlinks, performing speech recognition, and performing specific tasks according to the result of the speech recognition. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
Specification