Optimization of network microphone devices using noise classification
First Claim
Patent Images
1. A method comprising:
- detecting sound via one or more microphones of a network microphone device (NMD);
capturing sound data in a first buffer of the NMD based on the detected sound;
analyzing, via the NMD, the sound data to detect a trigger event;
transmitting, via the NMD, the sound data to a first one or more remote computing devices associated with a voice assistant service (VAS);
capturing metadata associated with the sound data in at least a second buffer of the NMD, wherein the sound data is not derivable from the metadata;
transmitting, via the NMD, the metadata absent the sound data to a second one or more remote computing devices associated with a remote evaluator, the remote evaluator being distinct from the VAS;
after detecting the trigger event, analyzing the metadata to classify noise in the sound data; and
based on the classified noise, modifying at least one performance parameter of the NMD.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for optimizing network microphone devices using noise classification are disclosed herein. In one example, individual microphones of a network microphone device (NMD) detect sound. The sound data is analyzed to detect a trigger event such as a wake word. Metadata associated with the sound data is captured in a lookback buffer of the NMD. After detecting the trigger event, the metadata is analyzed to classify noise in the sound data. Based on the classified noise, at least one performance parameter of the NMD is modified.
-
Citations
18 Claims
-
1. A method comprising:
-
detecting sound via one or more microphones of a network microphone device (NMD); capturing sound data in a first buffer of the NMD based on the detected sound; analyzing, via the NMD, the sound data to detect a trigger event; transmitting, via the NMD, the sound data to a first one or more remote computing devices associated with a voice assistant service (VAS); capturing metadata associated with the sound data in at least a second buffer of the NMD, wherein the sound data is not derivable from the metadata; transmitting, via the NMD, the metadata absent the sound data to a second one or more remote computing devices associated with a remote evaluator, the remote evaluator being distinct from the VAS; after detecting the trigger event, analyzing the metadata to classify noise in the sound data; and based on the classified noise, modifying at least one performance parameter of the NMD. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A network microphone device (NMD), comprising:
-
one or more processors; one or more microphones; and a tangible, non-transitory, computer-readable medium storing instructions executable by the one or more processors to cause the NMD to perform operations comprising; detecting sound via the one or more microphones; capturing sound data in a first buffer of the NMD based on the detected sound; analyzing, via the NMD, the sound data to detect a trigger event; transmitting, via the NMD, the sound data to a first one or more remote computing devices associated with a voice assistant service (VAS); capturing metadata associated with the sound data in at least a second buffer of the NMD, wherein the sound data is not derivable from the metadata; transmitting, via the NMD, the metadata absent the sound data to a second one or more remote computing devices associated with a remote evaluator, the remote evaluator being distinct from the VAS; after detecting the trigger event, analyzing the metadata to classify noise in the sound data; and based on the classified noise, modifying at least one performance parameter of the NMD. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. Tangible, non-transitory, computer-readable medium storing instructions executable by one or more processors to cause a network microphone device (NMD) to perform operations comprising:
-
detecting sound via one or more microphones of the NMD; capturing sound data in a first buffer of the NMD based on the detected sound; analyzing, via the NMD, the sound data to detect a trigger event; transmitting, via the NMD, the sound data to a first one or more remote computing devices associated with a voice assistant service (VAS); capturing metadata associated with the sound data in at least a second buffer of the NMD, wherein the sound data is not derivable from the metadata; transmitting, via the NMD, the metadata absent the sound data to a second one or more remote computing devices associated with a remote evaluator, the remote evaluator being distinct from the VAS; after detecting the trigger event, analyzing the metadata to classify noise in the sound data; and based on the classified noise, modifying at least one performance parameter of the NMD. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification