Noise classification for event detection
First Claim
Patent Images
1. A method comprising:
- detecting sound via one or more microphones of a network microphone device (NMD), wherein the detected sound includes a voice utterance;
capturing first sound data in a first buffer of the NMD based on the detected sound;
analyzing, via the NMD, the first sound data to detect a wake word;
based on the analyzed first sound data, detecting the wake word;
after detecting the wake word, transmitting at least the voice utterance to one or more remote computing devices associated with a voice assistant service;
detecting additional sound via the one or more microphones;
capturing second sound data in the first buffer based on the detected additional sound;
analyzing, via the NMD, the second sound data to detect the wake word, wherein the wake word is not detected based on the analyzed second sound data;
capturing metadata associated with the detected additional sound in a second buffer of the NMD;
processing the metadata to classify one or more noises in the detected additional sound; and
causing the NMD to perform an action based on the classification of the respective one or more noises.
2 Assignments
0 Petitions
Accused Products
Abstract
In one aspect, a network microphone device includes a plurality of microphones and is configured to detect sound via the one or more microphones. The network microphone device may capture sound data based on the detected sound in a first buffer, and capture metadata associated with the detected sound in a second buffer. The network microphone device may classify one or more noises in the detected sound and cause the network microphone device to perform an action based on the classification of the respective one or more noises.
617 Citations
20 Claims
-
1. A method comprising:
-
detecting sound via one or more microphones of a network microphone device (NMD), wherein the detected sound includes a voice utterance; capturing first sound data in a first buffer of the NMD based on the detected sound; analyzing, via the NMD, the first sound data to detect a wake word; based on the analyzed first sound data, detecting the wake word; after detecting the wake word, transmitting at least the voice utterance to one or more remote computing devices associated with a voice assistant service; detecting additional sound via the one or more microphones; capturing second sound data in the first buffer based on the detected additional sound; analyzing, via the NMD, the second sound data to detect the wake word, wherein the wake word is not detected based on the analyzed second sound data; capturing metadata associated with the detected additional sound in a second buffer of the NMD; processing the metadata to classify one or more noises in the detected additional sound; and causing the NMD to perform an action based on the classification of the respective one or more noises. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A network microphone device (NMD), comprising:
-
one or more processors; one or more microphones; a first buffer; a second buffer; a tangible, non-transitory, computer-readable medium storing instructions executable by the one or more processors to cause the NMD to perform operations comprising; detecting sound via one or more microphones of the NMD, wherein the detected sound includes a voice utterance; capturing first sound data in the first buffer based on the detected sound; analyzing the first sound data to detect a wake word; based on the analyzed first sound data, detecting the wake word; after detecting the wake word, transmitting at least the voice utterance to one or more remote computing devices associated with a voice assistant service; detecting additional sound via the one or more microphones; capturing second sound data in the first buffer based on the detected additional sound; analyzing the second sound data to detect the wake word, wherein the wake word is not detected based on the analyzed second sound data; capturing metadata associated with the detected additional sound in the second buffer; processing the metadata to classify one or more noises in the detected additional sound; and performing an action based on the classification of the respective one or more noises. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. Tangible, non-transitory, computer-readable medium storing instructions executable by one or more processors to cause a network microphone device (NMD) to perform operations comprising:
-
detecting sound via one or more microphones of the NMD, wherein the detected sound includes a voice utterance; capturing first sound data in a first buffer of the NMD based on the detected sound; analyzing the first sound data to detect a wake word; based on the analyzed first sound data, detecting the wake word; after detecting the wake word, transmitting at least the voice utterance to one or more remote computing devices associated with a voice assistant service; detecting additional sound via the one or more microphones; capturing second sound data in the first buffer based on the detected additional sound; analyzing the second sound data to detect the wake word, wherein the wake word is not detected based on the analyzed second sound data; capturing metadata associated with the detected additional sound in a second buffer of the NMD; processing the metadata to classify one or more noises in the detected additional sound; and performing an action based on the classification of the respective one or more noises. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification