Playback device supporting concurrent voice assistant services
First Claim
1. A playback device comprising:
- one or more amplifiers configured to drive one or more speakers;
at least one microphone;
a network interface;
one or more processors; and
data storage having stored therein instructions executable by the one or more processors to cause the playback device to perform a method comprising;
continuously capturing, via the at least one microphone, audio into one or more buffers;
analyzing the captured audio using a first wake-word detection algorithm and a second wake-word detection algorithm, wherein the first wake-word detection algorithm corresponds to a first voice assistant service associated with a first wake word, and wherein the second wake-word detection algorithm corresponds to a second voice assistant service associated with a second wake word;
when one of the first wake-word detection algorithm and the second wake-word detection algorithm detects, in the captured audio, a wake word corresponding to a particular voice assistant service of (a) the first voice assistant service or (b) the second voice assistant service, transmitting the captured audio to one or more servers associated with the particular voice assistant service for processing voice input in the captured audio;
after transmitting the captured audio, receiving, via the network interface, at least one instruction, wherein the at least one instruction is based on the voice input in the captured audio; and
outputting audio based on the at least one instruction via the one or more amplifiers configured to drive the one or more speakers.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are example techniques to support multiple voice assistant services. An example implementation may involve a playback device continuously capturing, via the at least one microphone, audio into one or more buffers and analyzing the captured audio using a first wake-word detection algorithm and a second wake-word detection algorithm. When one of the first wake-word detection algorithm or the second wake-word detection algorithm detects, in the captured audio, a wake-word corresponding to a particular voice assistant service of (a) the first voice assistant service or (b) the second voice assistant service, the playback device transmits the captured audio to one or more servers associated with the particular voice assistant service. After transmitting the captured audio, the playback device receives, via the network interface, at least one instruction based on the captured audio; and performs one or more actions based on the at least one instruction.
-
Citations
20 Claims
-
1. A playback device comprising:
-
one or more amplifiers configured to drive one or more speakers; at least one microphone; a network interface; one or more processors; and data storage having stored therein instructions executable by the one or more processors to cause the playback device to perform a method comprising; continuously capturing, via the at least one microphone, audio into one or more buffers; analyzing the captured audio using a first wake-word detection algorithm and a second wake-word detection algorithm, wherein the first wake-word detection algorithm corresponds to a first voice assistant service associated with a first wake word, and wherein the second wake-word detection algorithm corresponds to a second voice assistant service associated with a second wake word; when one of the first wake-word detection algorithm and the second wake-word detection algorithm detects, in the captured audio, a wake word corresponding to a particular voice assistant service of (a) the first voice assistant service or (b) the second voice assistant service, transmitting the captured audio to one or more servers associated with the particular voice assistant service for processing voice input in the captured audio; after transmitting the captured audio, receiving, via the network interface, at least one instruction, wherein the at least one instruction is based on the voice input in the captured audio; and outputting audio based on the at least one instruction via the one or more amplifiers configured to drive the one or more speakers. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method to be performed by a playback device comprising a network interface, at least one microphone, and one or more amplifiers configured to drive one or more speakers, the method comprising:
-
continuously capturing, via the at least one microphone, audio into one or more buffers; analyzing the captured audio using a first wake-word detection algorithm and a second wake-word detection algorithm, wherein the first wake-word detection algorithm corresponds to a first voice assistant service associated with a first wake word and wherein the second wake-word detection algorithm corresponds to a second voice assistant service associated with a second wake word; when one of the first wake-word detection algorithm and the second wake-word detection algorithm detects, in the captured audio, a wake word corresponding to a particular voice assistant service of (a) the first voice assistant service or (b) the second voice assistant service, transmitting the captured audio to one or more servers associated with the particular voice assistant service for processing voice input in the captured audio; after transmitting the captured audio, receiving, via the network interface, at least one instruction, wherein the at least one instruction is based on the voice input in the captured audio; and outputting audio based on the at least one instruction via the one or more amplifiers configured to drive the one or more speakers. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium having instructions stored thereon that are executable by one or more processors to cause a playback device to perform a method, the playback device comprising a network interface, at least one microphone, and one or more amplifiers configured to drive one or more speakers, the method comprising:
-
continuously capturing, via the at least one microphone, audio into one or more buffers; analyzing the captured audio using a first wake-word detection algorithm and a second wake-word detection algorithm, wherein the first wake-word detection algorithm corresponds to a first voice assistant service associated with a first wake word, and wherein the second wake-word detection algorithm corresponds to a second voice assistant service associated with a second wake word; when one of the first wake-word detection algorithm and the second wake-word detection algorithm detects, in the captured audio, a wake word corresponding to a particular voice assistant service of (a) the first voice assistant service or (b) the second voice assistant service, transmitting the captured audio to one or more servers associated with the particular voice assistant service for processing voice input in the captured audio; after transmitting the captured audio, receiving, via the network interface, at least one instruction, wherein the at least one instruction is based on the voice input in the captured audio; and outputting audio based on the at least one instruction via the one or more amplifiers configured to drive the one or more speakers. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification