×

Language model speech endpointing

  • US 10,121,471 B2
  • Filed: 06/29/2015
  • Issued: 11/06/2018
  • Est. Priority Date: 06/29/2015
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for determining an endpoint during automatic speech recognition (ASR) processing, the method comprising:

  • receiving audio data representing speech detected using a microphone of a mobile device;

    performing ASR processing on the audio data to determine a plurality of hypotheses;

    determining, for each of the plurality of hypotheses, a respective probability that the respective hypothesis corresponds to the audio data;

    determining, for each of the plurality of hypotheses, a respective number of non-speech audio frames immediately preceding a first point in the audio data;

    determining, for each of the plurality of hypotheses, a respective score by multiplying the probability of the respective hypothesis by a factor corresponding to the number of non-speech audio frames of the respective hypothesis;

    determining a cumulative score by summing the respective scores for each of the plurality of hypotheses;

    determining that the cumulative score exceeds a first threshold; and

    designating the first point as corresponding to a likely endpoint as a result of the cumulative score exceeding the first threshold.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×