×

Trigger word based beam selection

  • US 10,304,475 B1
  • Filed: 08/14/2017
  • Issued: 05/28/2019
  • Est. Priority Date: 08/14/2017
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method comprising:

  • receiving input audio data corresponding to input audio captured by a microphone array;

    performing beamforming on the input audio data to determine first beamformed audio data corresponding to a first direction and second beamformed audio data corresponding to a second direction;

    processing the first beamformed audio data to determine a first plurality of feature vectors corresponding to a first time period;

    processing the first plurality of feature vectors using a first neural network to determine a first score, the first score corresponding to a likelihood that at least a first portion of a wakeword is represented in the first beamformed audio data corresponding to first time period;

    processing the second beamformed audio data to determine a second plurality of feature vectors corresponding to a second time period;

    processing the second plurality of feature vectors using a second neural network to determine a second score, the second score corresponding to a likelihood that at least a second portion of the wakeword is represented in the second beamformed audio data corresponding to the second time period;

    determining, based on the first score exceeding a threshold, that the first portion of the wakeword is represented in the first beamformed audio data;

    determining, based on the second score exceeding the threshold, that the second portion of the wakeword is represented in the second beamformed audio data;

    determining that the first portion of the wakeword represented in the first beamformed audio data corresponds to the first time period;

    determining that the second portion of the wakeword represented in the second beamformed audio data corresponds to the second time period;

    selecting the first beamformed audio data in response to the first time period being prior to the second time period; and

    sending the first beamformed audio data for further processing.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×