×

Audio playback settings for voice interaction

  • US 9,942,678 B1
  • Filed: 09/27/2016
  • Issued: 04/10/2018
  • Est. Priority Date: 09/27/2016
  • Status: Active Grant
First Claim
Patent Images

1. A playback device comprising:

  • a network interface;

    one or more microphones;

    an audio stage comprising an amplifier;

    one or more speakers;

    one or more processors;

    a housing, the housing carrying at least the network interface, the one or more microphones, the audio stage, the one or more speakers, the one or more processors, and a computer-readable media having stored therein instructions executable by the one or more processors to cause the playback device to perform operations comprising;

    while playing back first audio in a given environment at a given loudness via the audio stage and the one or more speakers;

    (a) capturing, via the one or more microphones, a voice input;

    (b) determining that the captured voice input includes audio data representing a wake word to invoke a voice assistant service;

    (c) in response to determining that the captured voice input includes audio data representing the wake word to invoke the voice assistant service;

    (i) sending, via the network interface to one or more servers of the voice assistant service, the voice input and (ii) determining a loudness of background noise in the given environment, wherein the background noise comprises ambient noise in the given environment;

    (d) after determining the loudness of background noise, receiving, via the network interface from the one or more servers of the voice assistant service in response to the voice input, second audio data representing a spoken response to the voice input;

    in response to receiving the second audio data representing the spoken response to the voice input, ducking the first audio in proportion to a difference between the given loudness of the first audio and the determined loudness of the background noise; and

    playing back the ducked first audio concurrently with the second audio representing the spoken response to the voice input via the audio stage and the one or more speakers.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×