×

Anchored speech detection and speech recognition

  • US 10,373,612 B2
  • Filed: 06/29/2016
  • Issued: 08/06/2019
  • Est. Priority Date: 03/21/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method for identifying speech from a desired speaker for automatic speech recognition (ASR), the method comprising:

  • receiving audio data corresponding to speech, the audio data comprising a plurality of audio frames;

    processing the plurality of audio frames to determine a first plurality of feature vectors corresponding to a first portion of the audio data and a second plurality of feature vectors corresponding to a second portion of the audio data;

    determining that the first plurality of feature vectors corresponds to a wakeword;

    processing the first plurality of feature vectors with a recurrent neural network encoder to determine a reference feature vector corresponding to speech from a desired speaker;

    processing the second plurality of feature vectors, and the reference feature vector, using a neural-network classifier to determine a first score corresponding to a first feature vector in the second plurality, the first score corresponding to a likelihood that the first feature vector corresponds to audio spoken by the desired speaker;

    determining that the score is above a threshold;

    creating an indication that the first feature vector corresponds to speech from the desired speaker;

    determining a first weight corresponding to the first feature vector based on the first feature vector corresponding to speech from the desired speaker; and

    performing ASR using the first weight and the first feature vector.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×