×

Home graph

  • US 10,818,290 B2
  • Filed: 12/11/2018
  • Issued: 10/27/2020
  • Est. Priority Date: 12/11/2017
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising one or more servers of a voice assistant service, wherein the one or more servers are configured to communicate with a network microphone device (NMD) of a media playback system comprising multiple devices connected via a local area network,wherein the NMD is configured to perform operations comprising:

  • recording, via a microphone array, audio into a buffer;

    monitoring the recorded audio for wake-words; and

    when a wake-word is detected in the recorded audio, sending, via a network interface to the voice assistant service, data representing an audio recording from the buffer of the NMD, the audio recording comprising a voice input following the detected wake-word within the buffer; and

    wherein the one or more servers are configured to perform operations comprising;

    storing a data structure comprising nodes in a hierarchy representing the media playback system, wherein the data structure comprises (i) a root node representing the media playback system as a Home of the hierarchy, (ii) one or more first nodes in a first level, the first nodes representing respective devices of the media playback system as Sets of the hierarchy, and (ii) one or more second nodes in a second level as parents to one or more respective child first nodes to represent Sets in respective Rooms of the hierarchy, wherein the nodes in the hierarchy are assigned respective names;

    receiving, via a network interface of the one or more servers, data representing the audio recording;

    processing the audio recording to determine one or more voice commands within the voice input, wherein processing the audio recording comprises;

    determining, based on the data structure representing the media playback system, that one or more first voice commands within the voice input represent respective target variables indicating one or more particular nodes of the data structure, each target variable referencing a name of a respective node of the data structure; and

    determining that one or more second voice commands within the voice input correspond to one or more playback commands; and

    causing, via the network interface of the one or more servers, one or more particular playback devices to play back audio content according to the one or more playback commands, wherein the one or more particular playback devices include (a) all playback devices represented by the one or more particular nodes of the data structure and (b) all playback devices represented by child nodes of the one or more particular nodes of the data structure.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×