Noise classification for event detection

US 10,871,943 B1
Filed: 07/31/2019
Issued: 12/22/2020
Est. Priority Date: 07/31/2019
Status: Active Grant

First Claim

Patent Images

1. A method comprising:

detecting sound via one or more microphones of a network microphone device (NMD), wherein the detected sound includes a voice utterance;

capturing first sound data in a first buffer of the NMD based on the detected sound;

analyzing, via the NMD, the first sound data to detect a wake word;

based on the analyzed first sound data, detecting the wake word;

after detecting the wake word, transmitting at least the voice utterance to one or more remote computing devices associated with a voice assistant service;

detecting additional sound via the one or more microphones;

capturing second sound data in the first buffer based on the detected additional sound;

analyzing, via the NMD, the second sound data to detect the wake word, wherein the wake word is not detected based on the analyzed second sound data;

capturing metadata associated with the detected additional sound in a second buffer of the NMD;

processing the metadata to classify one or more noises in the detected additional sound; and

causing the NMD to perform an action based on the classification of the respective one or more noises.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

In one aspect, a network microphone device includes a plurality of microphones and is configured to detect sound via the one or more microphones. The network microphone device may capture sound data based on the detected sound in a first buffer, and capture metadata associated with the detected sound in a second buffer. The network microphone device may classify one or more noises in the detected sound and cause the network microphone device to perform an action based on the classification of the respective one or more noises.

617 Citations

20 Claims

1. A method comprising:
- detecting sound via one or more microphones of a network microphone device (NMD), wherein the detected sound includes a voice utterance;
  
  capturing first sound data in a first buffer of the NMD based on the detected sound;
  
  analyzing, via the NMD, the first sound data to detect a wake word;
  
  based on the analyzed first sound data, detecting the wake word;
  
  after detecting the wake word, transmitting at least the voice utterance to one or more remote computing devices associated with a voice assistant service;
  
  detecting additional sound via the one or more microphones;
  
  capturing second sound data in the first buffer based on the detected additional sound;
  
  analyzing, via the NMD, the second sound data to detect the wake word, wherein the wake word is not detected based on the analyzed second sound data;
  
  capturing metadata associated with the detected additional sound in a second buffer of the NMD;
  
  processing the metadata to classify one or more noises in the detected additional sound; and
  
  causing the NMD to perform an action based on the classification of the respective one or more noises.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The method of claim 1, wherein:
    - the second sound data transmitted to the one or more servers comprises recorded audio; and
      
      the metadata comprises spectral information that is temporally disassociated from the recorded audio.
  - 3. The method of claim 1, wherein processing the metadata comprises transmitting the metadata to one or more other remote servers for analyzing the metadata.
  - 4. The method of claim 1, wherein processing the metadata comprises locally analyzing the metadata and classifying the one or more noises via the NMD.
  - 5. The method of claim 1, wherein classifying the one or more noises comprises comparing the metadata to reference metadata associated with known noise events.
  - 6. The method of claim 1, wherein causing the NMD to perform an action comprises at least one of:
    - playing back a sound via the NMD, sending a notification to a user'"'"'s mobile computing device, or flashing a light.
  - 7. The method of claim 1, wherein causing the NMD to perform an action is based on at least one of a sound pressure level or a directionality of the sound.

8. A network microphone device (NMD), comprising:
- one or more processors;
  
  one or more microphones;
  
  a first buffer;
  
  a second buffer;
  
  a tangible, non-transitory, computer-readable medium storing instructions executable by the one or more processors to cause the NMD to perform operations comprising;
  
  detecting sound via one or more microphones of the NMD, wherein the detected sound includes a voice utterance;
  
  capturing first sound data in the first buffer based on the detected sound;
  
  analyzing the first sound data to detect a wake word;
  
  based on the analyzed first sound data, detecting the wake word;
  
  after detecting the wake word, transmitting at least the voice utterance to one or more remote computing devices associated with a voice assistant service;
  
  detecting additional sound via the one or more microphones;
  
  capturing second sound data in the first buffer based on the detected additional sound;
  
  analyzing the second sound data to detect the wake word, wherein the wake word is not detected based on the analyzed second sound data;
  
  capturing metadata associated with the detected additional sound in the second buffer;
  
  processing the metadata to classify one or more noises in the detected additional sound; and
  
  performing an action based on the classification of the respective one or more noises.
- View Dependent Claims (9, 10, 11, 12, 13, 14)
- - 9. The NMD of claim 8, wherein:
    - the second sound data transmitted to the one or more servers comprises recorded audio; and
      
      the metadata comprises spectral information that is temporally disassociated from the recorded audio.
  - 10. The NMD of claim 8, wherein processing the metadata comprises transmitting the metadata to one or more other remote servers for analyzing the metadata.
  - 11. The NMD of claim 8, wherein processing the metadata comprises locally analyzing the metadata and classifying the one or more noises via the NMD.
  - 12. The NMD of claim 8, wherein classifying the one or more noises comprises comparing the metadata to reference metadata associated with known noise events.
  - 13. The NMD of claim 8, wherein performing an action comprises at least one of:
    - playing back a sound via the NMD, sending a notification to a user'"'"'s mobile computing device, or flashing a light.
  - 14. The NMD of claim 8, wherein causing the NMD to perform an action is based on at least one of a sound pressure level or a directionality of the sound.

15. Tangible, non-transitory, computer-readable medium storing instructions executable by one or more processors to cause a network microphone device (NMD) to perform operations comprising:
- detecting sound via one or more microphones of the NMD, wherein the detected sound includes a voice utterance;
  
  capturing first sound data in a first buffer of the NMD based on the detected sound;
  
  analyzing the first sound data to detect a wake word;
  
  based on the analyzed first sound data, detecting the wake word;
  
  after detecting the wake word, transmitting at least the voice utterance to one or more remote computing devices associated with a voice assistant service;
  
  detecting additional sound via the one or more microphones;
  
  capturing second sound data in the first buffer based on the detected additional sound;
  
  analyzing the second sound data to detect the wake word, wherein the wake word is not detected based on the analyzed second sound data;
  
  capturing metadata associated with the detected additional sound in a second buffer of the NMD;
  
  processing the metadata to classify one or more noises in the detected additional sound; and
  
  performing an action based on the classification of the respective one or more noises.
- View Dependent Claims (16, 17, 18, 19, 20)
- - 16. The tangible, non-transitory, computer-readable medium of claim 15, wherein:
    - the second sound data transmitted to the one or more servers comprises recorded audio; and
      
      the metadata comprises spectral information that is temporally disassociated from the recorded audio.
  - 17. The tangible, non-transitory, computer-readable medium of claim 15, wherein processing the metadata comprises transmitting the metadata to one or more other remote servers for analyzing the metadata.
  - 18. The tangible, non-transitory, computer-readable medium of claim 15, wherein processing the metadata comprises locally analyzing the metadata and classifying the one or more noises via the NMD.
  - 19. The tangible, non-transitory, computer-readable medium of claim 15, wherein classifying the one or more noises comprises comparing the metadata to reference metadata associated with known noise events.
  - 20. The tangible, non-transitory, computer-readable medium of claim 15, wherein performing an action comprises at least one of:
    - playing back a sound via the NMD, sending a notification to a user'"'"'s mobile computing device, or flashing a light.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Sonos, Inc.
Original Assignee
Sonos, Inc.
Inventors
D'Amato, Nick, Soto, Kurt Thomas, Smith, Connor Kristopher
Primary Examiner(s)
Mei, Xu

Application Number

US16/528,016
Time in Patent Office

510 Days
Field of Search

381 56, 381 59, 381110, 700 94
US Class Current
CPC Class Codes

G06F 3/162   Interface to dedicated audi...

G06F 3/165   Management of the audio str...

G06F 3/167   Audio in a user interface, ...

G08B 13/1672   using sonic detecting means...

G10L 15/22   Procedures used during a sp...

G10L 15/30   Distributed recognition, e....

H04L 12/2809   indicating that an applianc...

H04R 2227/005   Audio distribution systems ...

H04R 3/12   for distributing signals to...

Noise classification for event detection

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

617 Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

Noise classification for event detection

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

617 Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links