System and method for processing multi-modal device interactions in a natural language voice services environment
First Claim
1. A computer-implemented method of facilitating natural language utterance processing via multiple input modes, the method being implemented on a computer that includes one or more physical processors executing one or more computer program instructions which, when executed, perform the method, the method comprising:
- receiving, from a user by the one or more physical processors via a first input mode, a first input;
receiving, from the user by the one or more physical processors via a second input mode that is different from the first input mode, a second input that relates to the first input;
determining, by the one or more physical processors, a request type from a plurality of request types based on the first input or the second input;
determining, by the one or more physical processors, a request associated with the request type;
determining, by the one or more physical processors based on the first input, first context information for the request;
determining, by the one or more physical processors based on the second input, second context information for the request; and
processing, by the one or more physical processors, the request based on the first context information and the second context information.
10 Assignments
0 Petitions
Accused Products
Abstract
A system and method for processing multi-modal device interactions in a natural language voice services environment may be provided. In particular, one or more multi-modal device interactions may be received in a natural language voice services environment that includes one or more electronic devices. The multi-modal device interactions may include a non-voice interaction with at least one of the electronic devices or an application associated therewith, and may further include a natural language utterance relating to the non-voice interaction. Context relating to the non-voice interaction and the natural language utterance may be extracted and combined to determine an intent of the multi-modal device interaction, and a request may then be routed to one or more of the electronic devices based on the determined intent of the multi-modal device interaction.
663 Citations
32 Claims
-
1. A computer-implemented method of facilitating natural language utterance processing via multiple input modes, the method being implemented on a computer that includes one or more physical processors executing one or more computer program instructions which, when executed, perform the method, the method comprising:
-
receiving, from a user by the one or more physical processors via a first input mode, a first input; receiving, from the user by the one or more physical processors via a second input mode that is different from the first input mode, a second input that relates to the first input; determining, by the one or more physical processors, a request type from a plurality of request types based on the first input or the second input; determining, by the one or more physical processors, a request associated with the request type; determining, by the one or more physical processors based on the first input, first context information for the request; determining, by the one or more physical processors based on the second input, second context information for the request; and processing, by the one or more physical processors, the request based on the first context information and the second context information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system for facilitating natural language utterance processing via multiple input modes, the system comprising:
one or more physical processors programmed to execute one or more computer program instructions which, when executed, cause the system to; receive, from a user via a first input mode, a first input; receive, from the user via a second input mode that is different from the first input mode, a second input that relates to the first input; determine a request type from a plurality of request types based on the first input or the second input; determine a request associated with the request type; determine, based on the first input, first context information for the request; determine, based on the second input, second context information for the request; and process the request based on the first context information and the second context information. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
Specification