CONTEXT-BASED DEVICE ARBITRATION
First Claim
1. A system comprising:
- one or more processors;
computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising;
receiving, from a first voice-enabled device, first audio data representing a speech utterance;
receiving, from the first voice-enabled device, a first audio signal metric value indicating a first signal-to-noise ratio associated with the first audio data;
receiving, from a second voice-enabled device, second audio data representing the speech utterance;
receiving, from the second voice-enabled device, a second audio signal metric value indicating a second signal-to-noise ratio associated with the second audio data;
determining that the first signal-to-noise ratio is greater than the second signal-to-noise ratio;
identifying device state data associated with the first voice-enabled device;
generating, using automatic speech recognition (ASR) on at least one of the first audio data or the second audio data, text data corresponding to the speech utterance;
determining, using natural language understanding (NLU) on the text data, intent data associated with the speech utterance, the intent data representing a request for a client device to perform an action;
determining, based at least in part on the device state data, that the first voice-enabled device is capable of performing the action responsive to the speech utterance;
determining a command to cause the first voice-enabled device to perform the action; and
sending, to the first voice-enabled device, data indicating the command.
1 Assignment
0 Petitions
Accused Products
Abstract
This disclosure describes, in part, context-based device arbitration techniques to select a voice-enabled device from multiple voice-enabled devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include determining a ranked list of voice-enabled devices that are ranked based on audio signal metric values for audio signals generated by each voice-enabled device, and iteratively moving through the list to determine, based on device states of the voice-enabled devices, whether one of the voice-enabled devices can perform an action responsive to the command. If the voice-enabled devices that detected the speech utterance are unable to perform the action responsive to the command, all other voice-enabled devices associated with an account may be analyzed to determine whether one of the other voice-enabled devices can perform the action responsive to the command in the speech utterance.
69 Citations
20 Claims
-
1. A system comprising:
-
one or more processors; computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising; receiving, from a first voice-enabled device, first audio data representing a speech utterance; receiving, from the first voice-enabled device, a first audio signal metric value indicating a first signal-to-noise ratio associated with the first audio data; receiving, from a second voice-enabled device, second audio data representing the speech utterance; receiving, from the second voice-enabled device, a second audio signal metric value indicating a second signal-to-noise ratio associated with the second audio data; determining that the first signal-to-noise ratio is greater than the second signal-to-noise ratio; identifying device state data associated with the first voice-enabled device; generating, using automatic speech recognition (ASR) on at least one of the first audio data or the second audio data, text data corresponding to the speech utterance; determining, using natural language understanding (NLU) on the text data, intent data associated with the speech utterance, the intent data representing a request for a client device to perform an action; determining, based at least in part on the device state data, that the first voice-enabled device is capable of performing the action responsive to the speech utterance; determining a command to cause the first voice-enabled device to perform the action; and sending, to the first voice-enabled device, data indicating the command. - View Dependent Claims (2, 3, 4)
-
-
5. A system comprising:
-
one or more processors; computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising; receiving a first device identifier of a first device; receiving first audio data associated with the first device identifier, the first audio data representing a sound; receiving a second device identifier of a second device; receiving second audio data associated with the second device identifier, the second audio data representing a portion of the sound, the portion of the sound being less than all the sound represented by the first audio data; receiving intent data representing a machine response to the sound; identifying first device state data associated with the first device; identifying second device state data associated with the second device; and based at least in part on the second device state data, determining the second device is to be used for the machine response. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12)
-
-
13. A method comprising:
-
receiving first audio data associated with a first device, the first audio data representing sound; receiving second audio data associated with a second device, the second audio data representing a portion of the sound that is less than all the sound represented in the first audio data; identifying first device state data associated with the first device; identifying second device state data associated with the second device; receiving intent data representing a machine response to the sound; and based at least in part on the second device state data, determining the second device is to be used for the machine response. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification