Networked devices, systems, and methods for intelligently deactivating wake-word engines
First Claim
1. A playback device comprising:
- a network interface;
one or more processors;
at least one microphone;
at least one speaker configured to output audio based on an audio stream;
a first wake-word engine configured to receive as input sound data based on sound detected by the at least one microphone, wherein the first wake-word engine is configured according to a first sensitivity level for false positives of a particular wake word;
a second wake-word engine configured to receive as input the audio stream, wherein the second wake-word engine is configured according to a second sensitivity level for false positives of the particular wake word that is more sensitive than the first sensitivity level;
a tangible, non-transitory, computer-readable medium having instructions stored thereon that are executable by the one or more processors to cause the playback device to;
identify in the audio stream, via the second wake-word engine, a false wake word for the first wake-word engine; and
based on identifying the false wake word, (i) deactivate the first wake-word engine and (ii) cause, via the network interface, at least one network microphone device to deactivate a wake-word engine of the at least one network microphone device for a particular amount of time;
while the first wake-word engine is deactivated, cause the at least one speaker to output the audio based on the audio stream; and
after a predetermined amount of time has elapsed, reactivate the first wake-word engine.
2 Assignments
0 Petitions
Accused Products
Abstract
In one aspect, a playback device is configured to identify in an audio stream, via a second wake-word engine, a false wake word for a first wake-word engine that is configured to receive as input sound data based on sound detected by a microphone. The first and second wake-word engines are configured according to different sensitivity levels for false positives of a particular wake word. Based on identifying the false wake word, the playback device is configured to (i) deactivate the first wake-word engine and (ii) cause at least one network microphone device to deactivate a wake-word engine for a particular amount of time. While the first wake-word engine is deactivated, the playback device is configured to cause at least one speaker to output audio based on the audio stream. After a predetermined amount of time has elapsed, the playback device is configured to reactivate the first wake-word engine.
783 Citations
20 Claims
-
1. A playback device comprising:
-
a network interface; one or more processors; at least one microphone; at least one speaker configured to output audio based on an audio stream; a first wake-word engine configured to receive as input sound data based on sound detected by the at least one microphone, wherein the first wake-word engine is configured according to a first sensitivity level for false positives of a particular wake word; a second wake-word engine configured to receive as input the audio stream, wherein the second wake-word engine is configured according to a second sensitivity level for false positives of the particular wake word that is more sensitive than the first sensitivity level; a tangible, non-transitory, computer-readable medium having instructions stored thereon that are executable by the one or more processors to cause the playback device to; identify in the audio stream, via the second wake-word engine, a false wake word for the first wake-word engine; and based on identifying the false wake word, (i) deactivate the first wake-word engine and (ii) cause, via the network interface, at least one network microphone device to deactivate a wake-word engine of the at least one network microphone device for a particular amount of time; while the first wake-word engine is deactivated, cause the at least one speaker to output the audio based on the audio stream; and after a predetermined amount of time has elapsed, reactivate the first wake-word engine. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A tangible, non-transitory, computer-readable medium having instructions stored thereon that are executable by one or more processors to cause a playback device to:
-
identify in an audio stream, via a second wake-word engine, a false wake word for a first wake-word engine that is configured to receive as input sound data based on sound detected by at least one microphone of the playback device, wherein the first wake-word engine is configured according to a first sensitivity level for false positives of a particular wake word, and wherein the second wake-word engine is configured according to a second sensitivity level for false positives of the particular wake word that is more sensitive than the first sensitivity level; based on identifying the false wake word, (i) deactivate the first wake-word engine and (ii) cause, via a network interface of the playback device, at least one network microphone device to deactivate a wake-word engine of the at least one network microphone device for a particular amount of time; while the first wake-word engine is deactivated, cause at least one speaker of the playback device to output audio based on the audio stream; and after a predetermined amount of time has elapsed, reactivate the first wake-word engine. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-implemented method comprising:
-
identifying in an audio stream, via a second wake-word engine of a playback device, a false wake word for a first wake-word engine that is configured to receive as input sound data based on sound detected by at least one microphone of the playback device, wherein the first wake-word engine is configured according to a first sensitivity level for false positives of a particular wake word, and wherein the second wake-word engine is configured according to a second sensitivity level for false positives of the particular wake word that is more sensitive than the first sensitivity level; based on identifying the false wake word, (i) deactivating the first wake-word engine and (ii) causing, via a network interface of the playback device, at least one network microphone device to deactivate a wake-word engine of the at least one network microphone device for a particular amount of time; while the first wake-word engine is deactivated, causing at least one speaker of the playback device to output audio based on the audio stream; and after a predetermined amount of time has elapsed, reactivating the first wake-word engine. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification