System and method for hybrid processing in a natural language voice services environment
First Claim
1. A method of natural language utterance processing, the method being implemented in a computer system that includes one or more physical processors executing one or more computer program instructions which, when executed, perform the method, the method comprising:
- receiving, at the computer system, from a first user device, a natural language utterance;
performing, at the computer system, speech recognition to determine one or more words of the natural language utterance;
receiving, at the computer system, from a second user device, a non-voice user input that relates to the natural language utterance, wherein the first user device and the second user device are associated with a user and are independently operable of one another and of the computer system;
processing, at the computer system, the non-voice user input to determine context information for the one or more words;
determining, at the computer system, one or more interpretations of the one or more words based on the context information;
generating, at the computer system, a user request related to the natural language utterance based on the one or more interpretations;
selecting, at the computer system, at least one user processing device to process the user request such that the selection of the user processing device is based on a determination that content related to the user request resides at the user processing device, wherein the user processing device comprises the first user device, the second user device, or a third user device independently operable of the computer system; and
transmitting, by the computer system, the user request to the user processing device to invoke the user processing device to process the user request.
8 Assignments
0 Petitions
Accused Products
Abstract
A system and method for hybrid processing in a natural language voice services environment that includes a plurality of multi-modal devices may be provided. In particular, the hybrid processing may generally include the plurality of multi-modal devices cooperatively interpreting and processing one or more natural language utterances included in one or more multi-modal requests. For example, a virtual router may receive various messages that include encoded audio corresponding to a natural language utterance contained in a multi-modal interaction provided to one or more of the devices. The virtual router may then analyze the encoded audio to select a cleanest sample of the natural language utterance and communicate with one or more other devices in the environment to determine an intent of the multi-modal interaction. The virtual router may then coordinate resolving the multi-modal interaction based on the intent of the multi-modal interaction.
-
Citations
31 Claims
-
1. A method of natural language utterance processing, the method being implemented in a computer system that includes one or more physical processors executing one or more computer program instructions which, when executed, perform the method, the method comprising:
-
receiving, at the computer system, from a first user device, a natural language utterance; performing, at the computer system, speech recognition to determine one or more words of the natural language utterance; receiving, at the computer system, from a second user device, a non-voice user input that relates to the natural language utterance, wherein the first user device and the second user device are associated with a user and are independently operable of one another and of the computer system; processing, at the computer system, the non-voice user input to determine context information for the one or more words; determining, at the computer system, one or more interpretations of the one or more words based on the context information; generating, at the computer system, a user request related to the natural language utterance based on the one or more interpretations; selecting, at the computer system, at least one user processing device to process the user request such that the selection of the user processing device is based on a determination that content related to the user request resides at the user processing device, wherein the user processing device comprises the first user device, the second user device, or a third user device independently operable of the computer system; and transmitting, by the computer system, the user request to the user processing device to invoke the user processing device to process the user request. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for natural language utterance processing, the system comprising:
one or more physical processors programmed to execute one or more computer program instructions which, when executed, cause the one or more physical processors to; receive, from a first user device, a natural language utterance; perform speech recognition to determine one or more words of the natural language utterance; receive, from a second user device, a non-voice user input that relates to the natural language utterance, wherein the first user device and the second user device are associated with a user and are independently operable of one another and of the computer system; process the non-voice user input to determine context information for the one or more words; determine one or more interpretations of the one or more words based on the context information; generate a user request related to the natural language utterance based on the one or more interpretations; select at least one user processing device to process the user request such that the selection of the user processing device is based on a determination that content related to the user request resides at the user processing device, wherein the user processing device comprises the first user device, the second user device, or a third user device independently operable of the computer system; and transmit the user request to the user processing device to invoke the user processing device to process the user request. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24)
-
25. A method of natural language utterance processing, the method being implemented in a computer system that includes one or more physical processors executing one or more computer program instructions which, when executed, perform the method, the method comprising:
-
receiving, at the computer system, from a first user device, a natural language utterance; processing, at the computer system, the natural language utterance to determine one or more words of the natural language utterance; receiving, at the computer system, from a second user device, a non-voice user input that relates to the natural language utterance, wherein the first user device and the second user device are associated with a user and are independently operable of one another and of the computer system; processing, at the computer system, the non-voice user input to determine context information for the one or more words; and transmit, by the computer system, the one or more words and the context information to a user processing device to invoke the user processing device to determine one or more interpretations of the natural language utterance based on the one or more words and the context information, wherein the user processing device comprises the first user device, the second user device, or a third user device independently operable of the computer system. - View Dependent Claims (26, 27, 28, 29, 30, 31)
-
Specification