Devices, systems, and methods for distributed voice processing
First Claim
1. A method comprising:
- detecting sound via a microphone array of a first playback device;
transmitting data associated with the detected sound from the first playback device to a second playback device over a local area network;
analyzing, via a wake word engine of the second playback device, the transmitted data associated with the detected sound for identification of a wake word;
identifying that the detected sound contains the wake word based on the analysis via the wake word engine;
based on the identification, transmitting sound data corresponding to the detected sound from the second playback device to a remote computing device over a wide area network, wherein the remote computing device is associated with a particular voice assistant service;
receiving via the second playback device a response from the remote computing device, wherein the response is based on the detected sound;
transmitting a message from the second playback device to the first playback device over the local area network, wherein the message is based on the response from the remote computing device and includes instructions to perform an action; and
performing the action via the first playback device.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for distributed voice processing are disclosed herein. In one example, the method includes detecting sound via a microphone array of a first playback device and analyzing, via a first wake-word engine of the first playback device, the detected sound. The first playback device may transmit data associated with the detected sound to a second playback device over a local area network. A second wake-word engine of the second playback device may analyze the transmitted data associated with the detected sound. The method may further include identifying that the detected sound contains either a first wake word or a second wake word based on the analysis via the first and second wake-word engines, respectively. Based on the identification, sound data corresponding to the detected sound may be transmitted over a wide area network to a remote computing device associated with a particular voice assistant service.
459 Citations
20 Claims
-
1. A method comprising:
-
detecting sound via a microphone array of a first playback device; transmitting data associated with the detected sound from the first playback device to a second playback device over a local area network; analyzing, via a wake word engine of the second playback device, the transmitted data associated with the detected sound for identification of a wake word; identifying that the detected sound contains the wake word based on the analysis via the wake word engine; based on the identification, transmitting sound data corresponding to the detected sound from the second playback device to a remote computing device over a wide area network, wherein the remote computing device is associated with a particular voice assistant service; receiving via the second playback device a response from the remote computing device, wherein the response is based on the detected sound; transmitting a message from the second playback device to the first playback device over the local area network, wherein the message is based on the response from the remote computing device and includes instructions to perform an action; and performing the action via the first playback device. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A first playback device comprising:
-
one or more processors; a computer-readable medium storing instructions that, when executed by the one or more processors, cause the first playback device to perform operations comprising; receiving, from a second playback device over a local area network, data associated with sound detected via a microphone array of the second playback device; analyzing, via a wake word engine of the first playback device, the data associated with the detected sound for identification of a wake word; identifying that the detected sound contains the wake word based on the analysis via the wake word engine; based on the identification, transmitting sound data corresponding to the detected sound to a remote computing device over a wide area network, wherein the remote computing device is associated with a particular voice assistant service; receiving a response from the remote computing device, wherein the response is based on the detected sound; and transmitting a message to the second playback device over the local area network, wherein the message is based on the response from the remote computing device and includes instructions for the second playback device to perform an action. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system, comprising:
-
a first playback device comprising; one or more processors; a microphone array; and a first computer-readable medium storing instructions that, when executed by the one or more processors, cause the first playback device to perform first operations, the first operations comprising; detecting sound via the microphone array; transmitting data associated with the detected sound to a second playback device over a local area network; the second playback device comprising; one or more processors; and a second computer-readable medium storing instructions that, when executed by the one or more processors, cause the second playback device to perform second operations, the second operations comprising; analyzing, via a wake word engine of the second playback device, the transmitted data associated with the detected sound from the first playback device for identification of a wake word; identifying that the detected sound contains the wake word based on the analysis via the wake word engine; based on the identification, transmitting sound data corresponding to the detected sound to a remote computing device over a wide area network, wherein the remote computing device is associated with a particular voice assistant service; receiving a response from the remote computing device, wherein the response is based on the detected sound; and transmitting a message to the first playback device over the local area network, wherein the message is based on the response from the remote computing device and includes instructions to perform an action, wherein the first computer-readable medium of the first playback device causes the first playback device to perform the action from the instructions received from the second playback device. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification