Voice control of playback device using voice assistant service(s)
First Claim
1. A playback device comprising:
- one or more amplifiers configured to drive one or more speakers;
a microphone array;
a network interface;
one or more processors;
tangible, non-transitory computer-readable media having stored therein instructions executable by the one or more processors to cause the playback device to perform a method comprising;
continuously capturing, via the microphone array, audio into one or more buffers;
analyzing the captured audio using multiple wake-word detection algorithms running concurrently on the one or more processors, each wake-word detection algorithm corresponding to a respective voice assistant service among multiple voice assistant services supported by the playback device;
when a particular wake-word detection algorithm of the multiple wake-word detection algorithms detects, in the captured audio, a wake-word corresponding to a particular voice assistant service, transmitting, via the network interface, the captured audio to the particular voice assistant service, wherein the captured audio comprises a voice input, wherein the voice input comprises a command to modify at least one playback setting of a media playback system, and wherein the media playback system comprises the playback device;
after transmitting the captured audio, receiving, from one or more servers of the particular voice assistant service via the network interface, instructions to modify the at least one playback setting according to the command;
modifying the at least one playback setting based on the instructions; and
with the at least one playback setting modified, playing back at least one audio track via the one or more amplifiers configured to drive the one or more speakers.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are example techniques to identify a voice service to process a voice input. An example implementation may involve a playback device capturing, via a microphone array, audio into one or more buffers. The playback device analyzes analyzing the captured audio using multiple wake-word detection algorithms. When a particular wake-word detection algorithm detects a wake-word corresponding to a particular voice assistant service, the playback device transmits the captured audio to the particular voice assistant service. The captured audio includes a voice input that includes a command to modify at least one playback setting of a media playback system. After transmitting the captured audio, the playback device receives, from the particular voice assistant service, instructions to modify the at least one playback setting according to the command, modifies the at least one playback setting, and with the at least one playback setting modified, plays back at least one audio track.
468 Citations
20 Claims
-
1. A playback device comprising:
-
one or more amplifiers configured to drive one or more speakers; a microphone array; a network interface; one or more processors; tangible, non-transitory computer-readable media having stored therein instructions executable by the one or more processors to cause the playback device to perform a method comprising; continuously capturing, via the microphone array, audio into one or more buffers; analyzing the captured audio using multiple wake-word detection algorithms running concurrently on the one or more processors, each wake-word detection algorithm corresponding to a respective voice assistant service among multiple voice assistant services supported by the playback device; when a particular wake-word detection algorithm of the multiple wake-word detection algorithms detects, in the captured audio, a wake-word corresponding to a particular voice assistant service, transmitting, via the network interface, the captured audio to the particular voice assistant service, wherein the captured audio comprises a voice input, wherein the voice input comprises a command to modify at least one playback setting of a media playback system, and wherein the media playback system comprises the playback device; after transmitting the captured audio, receiving, from one or more servers of the particular voice assistant service via the network interface, instructions to modify the at least one playback setting according to the command; modifying the at least one playback setting based on the instructions; and with the at least one playback setting modified, playing back at least one audio track via the one or more amplifiers configured to drive the one or more speakers. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A tangible, non-transitory computer-readable medium having stored therein instructions executable by one or more processors to cause a playback device to perform a method comprising:
-
continuously capturing, via a microphone array of the playback device, audio into one or more buffers; analyzing the captured audio using multiple wake-word detection algorithms running concurrently on one or more processors of the playback device, each wake-word detection algorithm corresponding to a respective voice assistant service among multiple voice assistant services supported by the playback device; when a particular wake-word detection algorithm of the multiple wake-word detection algorithms detects, in the captured audio, a wake-word corresponding to a particular voice assistant service, transmitting, via a network interface of the playback device, the captured audio to the particular voice assistant service, wherein the captured audio comprises a voice input, wherein the voice input comprises a command to modify at least one playback setting of a media playback system, and wherein the media playback system comprises the playback device; after transmitting the captured audio, receiving, from one or more servers of the particular voice assistant service via the network interface, instructions to modify the at least one playback setting according to the command; modifying the at least one playback setting based on the instructions; and with the at least one playback setting modified, playing back at least one audio track via one or more amplifiers configured to drive one or more speakers. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method comprising:
-
continuously capturing, via a microphone array of a playback device, audio into one or more buffers; analyzing the captured audio using multiple wake-word detection algorithms running concurrently on one or more processors of the playback device, each wake-word detection algorithm corresponding to a respective voice assistant service among multiple voice assistant services supported by the playback device; when a particular wake-word detection algorithm of the multiple wake-word detection algorithms detects, in the captured audio, a wake-word corresponding to a particular voice assistant service, transmitting, via a network interface of the playback device, the captured audio to the particular voice assistant service, wherein the captured audio comprises a voice input, wherein the voice input comprises a command to modify at least one playback setting of a media playback system, and wherein the media playback system comprises the playback device; after transmitting the captured audio, receiving, from one or more servers of the particular voice assistant service via the network interface, instructions to modify the at least one playback setting according to the command; modifying the at least one playback setting based on the instructions; and with the at least one playback setting modified, playing back at least one audio track via one or more amplifiers configured to drive one or more speakers. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification