Task assistant utilizing context for improved interaction
First Claim
1. A method comprising:
- receiving input, at a first device, from a user through multi modal input including a plurality of speech input, typing input, and touch input;
determining a meaning of the input;
determining, based on prior interactions with the user, that there is context related to the input, wherein the context comprises information that can be used to interpret a request in the input;
generating, based on a combination of the input and the context, an interpreted input in a natural language format, wherein the interpreted input comprises a query;
translating the interpreted input from the natural language format to a format compatible with an application, to generate a formatted query;
providing the formatted query to the application;
receiving data from the application in response to the formatted query;
providing a response to the user through multi modal output including a plurality of;
speech output, text output, non-speech audio output, haptic output, and visual non-text output;
updating the context based on the interpreted input;
verifying, based on a connection type between the first device and a second device, that the first device is proximal to the second device; and
in response to the verifying that the first device is proximal to the second device, transmitting the context from the first device to the second device.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of providing a task assistant is described. The task assistant is designed to receive input from a user through multimodal input including a plurality of speech input, typing input, and touch input, determine the meaning of the input, and determining whether there is a context based on prior interactions with the user. The method further to generate an interpreted input based on a combination of the input and the context, and providing a formatted query to an application. The method further to receive data from the application in response to the formatted query, and provide a response to the user through multimodal output including a plurality of: speech output, text output, non-speech audio output, haptic output, and visual non-text output. The method further to update the context based on the interpreted input.
11 Citations
20 Claims
-
1. A method comprising:
-
receiving input, at a first device, from a user through multi modal input including a plurality of speech input, typing input, and touch input; determining a meaning of the input; determining, based on prior interactions with the user, that there is context related to the input, wherein the context comprises information that can be used to interpret a request in the input; generating, based on a combination of the input and the context, an interpreted input in a natural language format, wherein the interpreted input comprises a query; translating the interpreted input from the natural language format to a format compatible with an application, to generate a formatted query; providing the formatted query to the application; receiving data from the application in response to the formatted query; providing a response to the user through multi modal output including a plurality of; speech output, text output, non-speech audio output, haptic output, and visual non-text output; updating the context based on the interpreted input; verifying, based on a connection type between the first device and a second device, that the first device is proximal to the second device; and in response to the verifying that the first device is proximal to the second device, transmitting the context from the first device to the second device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method comprising:
-
receiving input, from a user at a first device, through multi modal input including a plurality of speech input, typing input, and touch input; determining, based on prior interactions with the user, that there is context corresponding to the input, wherein the context comprises information that can be used to interpret a request in the input; generating, based on a combination of the input and the context, an interpreted input in a natural language format, wherein the interpreted input comprises a query; translating the interpreted input from the natural language format to a format compatible with an application, to generate a formatted query; recording the formatted query in a query log; passing the formatted query to the application; receiving data from the application in response to the formatted query; providing, based on the data, a response to the user through multi modal output including a plurality of;
speech output, text output, non-speech audio output, haptic output, and visual non-text output;updating the context based on the interpreted input; determining, based on a network connection type between the first device and a second device, that the first device is proximal to the second device; and transferring, in response to the determination that the first device is proximal to the second device, the query log from the first device to the second device. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A method comprising:
-
receiving input from a user, at a first device, through multi modal input including a plurality of speech input, typing input, and touch input; determining, based on prior interactions with the user, that there is a first context related to the input, wherein the first context comprises information that can be used to interpret a request in the input; generating, based on the input and the first context, an interpreted input in a natural language format, wherein the interpreted input comprises a query; translating the interpreted input from the natural language format to a format compatible with an application, to generate a formatted query; providing the formatted query to the application; recording the formatted query in a query log; receiving data from the application in response to the formatted query; providing a response to the user through multi modal output including a plurality of;
speech output, text output, non-speech audio output, haptic output, and visual non-text output;updating the first context based on the interpreted input; determining that the user has logged in to an application at a second device; verifying, by communicating over a local network, that the first device is proximal to the second device; in response to the determining that the user has logged in to the application at the second device and the verifying that the first device is proximal to the second device, transmitting the query log from the first device to the second device; and generating, at the second device and based on the query log, second context for a session of a task assistant. - View Dependent Claims (18, 19, 20)
-
Specification