SYSTEM AND METHOD FOR AN INTEGRATED, MULTI-MODAL, MULTI-DEVICE NATURAL LANGUAGE VOICE SERVICES ENVIRONMENT
10 Assignments
0 Petitions
Accused Products
Abstract
A system and method for an integrated, multi-modal, multi-device natural language voice services environment may be provided. In particular, the environment may include a plurality of voice-enabled devices each having intent determination capabilities for processing multi-modal natural language inputs in addition to knowledge of the intent determination capabilities of other devices in the environment. Further, the environment may be arranged in a centralized manner, a distributed peer-to-peer manner, or various combinations thereof. As such, the various devices may cooperate to determine intent of multi-modal natural language inputs, and commands, queries, or other requests may be routed to one or more of the devices best suited to take action in response thereto.
286 Citations
54 Claims
-
1-27. -27. (canceled)
-
28. A method of processing natural language utterances, the method being implemented by a first device that comprises one or more physical processors executing one or more computer program instructions which, when executed, perform the method, the method comprising:
-
receiving, by the first device, a natural language utterance spoken by a user; performing, by the first device, speech recognition to determine one or more words of the natural language utterance; determining, by the first device, based on the one or more words, a first prediction of an intent of the user; transmitting, by the first device, the natural language utterance to a second device; receiving, from the second device by the first device, a second prediction of the intent of the user; and determining, by the first device, the intent of the user based on the first prediction and the second prediction. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41)
-
-
42. A system for processing natural language utterances, the system comprising:
a first device having one or more physical processors programmed to execute one or more computer program instructions which, when executed, cause the one or more physical processors to; receive a natural language utterance spoken by a user; perform speech recognition to determine one or more words of the natural language utterance; determine, based on the one or more words, a first prediction of an intent of the user; transmit the natural language utterance to a second device; receive, from the second device, a second prediction of the intent of the user; and determine the intent of the user based on the first prediction and the second prediction. - View Dependent Claims (43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54)
Specification