PROCESSING SPOKEN COMMANDS TO CONTROL DISTRIBUTED AUDIO OUTPUTS
First Claim
1. A computer-implemented method for controlling a speaker system using an input device, the method comprising:
- associating, by at least one device, an input device with a first wireless speaker;
receiving, from the input device by the at least one device, input audio data corresponding to an utterance;
performing, by the at least one device, speech processing on the input audio data to determine speech processing output;
determining, by the at least one device, that the speech processing output identifies a first command to output second audio;
determining, by the at least one device, that the speech processing output identifies a desired output location;
determining, from among a plurality of output devices, a first wireless speaker corresponding to the desired output location;
determining, by the at least one device, that the first wireless speaker is controllable by a network-connected device;
determining, by the at least one device, that the first wireless speaker is outputting first audio;
generating, by the at least one device, a second command instructing the network-connected device to cause a volume level of the first audio to be reduced, the second command executable at least in part by the network-connected device;
sending, by the at least one device, a first instruction to the network-connected device to execute the second command;
determining, by the at least one device, that the speech processing output identifies indicates an audio source from which to generate the second audio;
sending, by the at least one device, output audio data corresponding to the audio source using an address identifier associated with the output audio data;
sending, by the at least one device, the address identifier to the network-connected device; and
sending, by the at least one device, a second instruction to the network-connected device to execute the first command, the first command instructing the network-connected device to obtain the output audio data using the address identifier and to cause the second audio to be generated from the output audio data using the first wireless speaker.
1 Assignment
0 Petitions
Accused Products
Abstract
A system that is capable of controlling multiple entertainment systems and/or speakers using voice commands. The system receives voice commands and may determine audio sources and speakers indicated by the voice commands. The system may generate audio data from the audio sources and may send the audio data to the speakers using multiple interfaces. For example, the system may send the audio data directly to the speakers using a network address, may send the audio data to the speakers via a voice-enabled device or may send the audio data to the speakers via a speaker controller. The system may generate output zones including multiple speakers and may associate input devices with speakers within the output zones. For example, the system may receive a voice command from an input device in an output zone and may reduce output audio generated by speakers in the output zone.
421 Citations
20 Claims
-
1. A computer-implemented method for controlling a speaker system using an input device, the method comprising:
-
associating, by at least one device, an input device with a first wireless speaker; receiving, from the input device by the at least one device, input audio data corresponding to an utterance; performing, by the at least one device, speech processing on the input audio data to determine speech processing output; determining, by the at least one device, that the speech processing output identifies a first command to output second audio; determining, by the at least one device, that the speech processing output identifies a desired output location; determining, from among a plurality of output devices, a first wireless speaker corresponding to the desired output location; determining, by the at least one device, that the first wireless speaker is controllable by a network-connected device; determining, by the at least one device, that the first wireless speaker is outputting first audio; generating, by the at least one device, a second command instructing the network-connected device to cause a volume level of the first audio to be reduced, the second command executable at least in part by the network-connected device; sending, by the at least one device, a first instruction to the network-connected device to execute the second command; determining, by the at least one device, that the speech processing output identifies indicates an audio source from which to generate the second audio; sending, by the at least one device, output audio data corresponding to the audio source using an address identifier associated with the output audio data; sending, by the at least one device, the address identifier to the network-connected device; and sending, by the at least one device, a second instruction to the network-connected device to execute the first command, the first command instructing the network-connected device to obtain the output audio data using the address identifier and to cause the second audio to be generated from the output audio data using the first wireless speaker. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-implemented method comprising:
-
receiving, from an input device by at least one device, input audio data corresponding to an utterance; performing, by the at least one device, speech processing on the input audio data to determine speech processing output; determining, by the at least one device, that the speech processing output identifies a first command; determining, by the at least one device, that the speech processing output identifies a desired output location; determining, from among a plurality of output devices, a first output device corresponding to the desired output location; determining, by the at least one device, that the first output device is controllable by a network-connected device; determining, by the at least one device, that the first output device is outputting first audio; generating, by the at least one device, a second command instructing the network-connected device to cause a volume level of the first audio to be reduced, the second command executable at least in part by the network-connected device; sending, by the at least one device, a first instruction to the network-connected device to execute the second command; and sending, by the at least one device, a second instruction to the network-connected device to execute the first command and send command output to the first output device. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12)
-
-
13. A device, comprising:
-
at least one processor; memory including instructions operable to be executed by the at least one processor to configure the device to; receive, from an input device, input audio data corresponding to an utterance; perform speech processing on the input audio data to determine speech processing output; determine that the speech processing output identifies a first command; determine that the speech processing output identifies a desired output location; determine, from among a plurality of output devices, a first output device corresponding to the desired output location; determine that the first output device is controllable by a network-connected device; determine that the first output device is outputting first audio; generate a second command instructing the network-connected device to cause a volume level of the first audio to be reduced, the second command executable at least in part by the network-connected device; send a first instruction to the network-connected device to execute the second command; and send a second instruction to the network-connected device to execute the first command and send command output to the first output device. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification