Voice detection by multiple devices
First Claim
Patent Images
1. A first networked microphone device (NMD) comprising:
- one or more amplifiers configured to drive one or more speakers;
a microphone array;
a network interface;
one or more processors;
tangible, non-transitory computer-readable media having stored therein instructions executable by the one or more processors to cause the first NMD to perform a method comprising;
continuously recording, via the microphone array, audio into a buffer;
detecting, in the recorded audio, a wake-word;
in response to detecting the wake-word, (i) listening, via the microphone array, for a voice command following the wake-word in the recorded audio and (ii) sending, via the network interface, instructions to one or more second NMDs connected via to the first NMD via a local area network, the instructions causing the one or more second NMDs to stop recording audio via respective microphone arrays of the one or more second NMDs for a pre-defined time period;
querying, via the network interface, one or more servers of a particular voice assistant service with the voice command following the detected wake-word within the recorded audio;
receiving, from one or more servers of the particular voice assistant service via the network interface in response to the query, a playback command corresponding to the voice command; and
playing back audio content according to the playback command via the one or more amplifiers configured to drive one or more speakers.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are example techniques for voice detection by multiple NMDs. An example implementation may involve receiving a set of voice recordings from a set of NMDs, and identifying a subset of voice recordings from which to determine a given voice command. The example implementation may further involve causing the identified subset of voice recordings to be analyzed to determine the given voice command.
-
Citations
20 Claims
-
1. A first networked microphone device (NMD) comprising:
-
one or more amplifiers configured to drive one or more speakers; a microphone array; a network interface; one or more processors; tangible, non-transitory computer-readable media having stored therein instructions executable by the one or more processors to cause the first NMD to perform a method comprising; continuously recording, via the microphone array, audio into a buffer; detecting, in the recorded audio, a wake-word; in response to detecting the wake-word, (i) listening, via the microphone array, for a voice command following the wake-word in the recorded audio and (ii) sending, via the network interface, instructions to one or more second NMDs connected via to the first NMD via a local area network, the instructions causing the one or more second NMDs to stop recording audio via respective microphone arrays of the one or more second NMDs for a pre-defined time period; querying, via the network interface, one or more servers of a particular voice assistant service with the voice command following the detected wake-word within the recorded audio; receiving, from one or more servers of the particular voice assistant service via the network interface in response to the query, a playback command corresponding to the voice command; and playing back audio content according to the playback command via the one or more amplifiers configured to drive one or more speakers. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. Tangible, non-transitory, computer-readable media having instructions encoded therein, wherein the instructions, when executed by one or more processors, cause a first networked microphone device (NMD) to perform a method comprising:
-
continuously recording, via a microphone array of the first NMD, audio into a buffer; detecting, in the recorded audio, a wake-word; in response to detecting the wake-word, (i) listening, via a microphone of the first NMD, for a voice command following the wake-word in the recorded audio and (ii) sending, via a network interface, instructions to one or more second NMDs connected via to the first NMD via a local area network, the instructions causing the one or more second NMDs to stop recording audio via respective microphone arrays of the one or more second NMDs for a pre-defined time period; querying, via a network interface of the first NMD, one or more servers of a particular voice assistant service with the voice command following the detected wake-word within the recorded audio; receiving, from one or more servers of the particular voice assistant service via the network interface in response to the query, a playback command corresponding to the voice command; and playing back audio content according to the playback command via one or more amplifiers configured to drive one or more speakers. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method comprising:
-
a first networked microphone device (NMD) continuously recording, via a microphone array of the first NMD, audio into a buffer; the first NMD detecting, in the recorded audio, a wake-word; in response to detecting the wake-word, the first NMD (i) listening, via a microphone of the first NMD, for a voice command following the wake-word in the recorded audio and (ii) sending, via a network interface, instructions to one or more second NMDs connected via to the first NMD via a local area network, the instructions causing the one or more second NMDs to stop recording audio via respective microphone arrays of the one or more second NMDs for a pre-defined time period; the first NMD querying, via a network interface of the first NMD, one or more servers of a particular voice assistant service with the voice command following the detected wake-word within the recorded audio; the first NMD receiving, from one or more servers of the particular voice assistant service via the network interface in response to the query, a playback command corresponding to the voice command; and the first NMD playing back audio content according to the playback command via one or more amplifiers configured to drive one or more speakers. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification