×

Systems and methods for speech transcription

  • US 10,540,957 B2
  • Filed: 06/09/2015
  • Issued: 01/21/2020
  • Est. Priority Date: 12/15/2014
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for transcribing speech comprising:

  • receiving an input audio from a user;

    normalizing the input audio to make a total power of the input audio consistent with a set of training samples used to train a trained neural network;

    generating a jitter set of audio files from the normalized input audio by translating the normalized input audio by one or more time values;

    for each audio file from the jitter set of audio files, which includes the normalized input audio;

    generating a set of spectrogram frames for each audio file;

    inputting the set of spectrogram frames into a trained neural network;

    obtaining predicted character probabilities outputs from the trained neural network; and

    decoding a transcription of the input audio using the predicted character probabilities outputs from the trained neural network constrained by a language model that interprets a string of characters from the predicted character probabilities outputs as a word or words.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×