×

ANCHORED SPEECH DETECTION AND SPEECH RECOGNITION

  • US 20170270919A1
  • Filed: 06/29/2016
  • Published: 09/21/2017
  • Est. Priority Date: 03/21/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method for identifying speech from a desired speaker for automatic speech recognition (ASR), the method comprising:

  • receiving audio data corresponding to speech, the audio data comprising a plurality of audio frames;

    processing the plurality of audio frames to determine a first plurality of audio feature vectors corresponding to a first portion of the audio data and a second plurality of audio feature vectors corresponding to a second portion of the audio data;

    determining that the first plurality of audio feature vectors corresponds to a wakeword;

    processing the first plurality of audio feature vectors with a recurrent neural network encoder to determine a reference feature vector corresponding to speech from a desired speaker;

    processing the second plurality of audio feature vectors, and the reference feature vector, using a neural-network classifier to determine a first score corresponding to a first audio feature vector in the second plurality, the first score corresponding to a likelihood that the first audio feature vector corresponds to audio spoken by the desired speaker;

    determining that the score is above a threshold;

    creating an indication that the first feature vector corresponds to speech from the desired speaker;

    determining a first weight corresponding to the first feature vector based on the first feature vector corresponding to speech from the desired speaker; and

    performing ASR using the first weight and the first feature vector.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×