×

Aligning a transcript to audio data

  • US 8,719,024 B1
  • Filed: 03/05/2012
  • Issued: 05/06/2014
  • Est. Priority Date: 09/25/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • receiving audio data and a textual transcript of the audio data to be aligned with the audio data;

    generating, from the textual transcript, a language model that represents a set of particular substrings of the textual transcript, the language model comprising allowed states of the language model and one or more transitions that link the allowed states;

    receiving, from a speech recognizer, recognized language elements from the received audio data and times at which the recognized language elements occur in the audio data;

    comparing the recognized language elements from the audio data to substrings represented by the language model to identify times at which particular ones of the substrings occur in the audio data;

    aligning a portion of the textual transcript with a portion of the audio data using the identified times; and

    outputting the aligned portion of the textual transcript.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×