×

Aligning a transcript to audio data

  • US 8,131,545 B1
  • Filed: 09/25/2008
  • Issued: 03/06/2012
  • Est. Priority Date: 09/25/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method, comprising:

  • receiving audio data and a transcript of the audio data;

    generating, from the transcript, a language model comprising a factor automaton that includes automaton states and arcs, each of the automaton arcs corresponding to a language element from the transcript;

    receiving, from a speech recognizer, language elements recognized from the received audio data and times at which the recognized language elements occur in the audio data;

    comparing the recognized language elements from the audio data to one or more of the language elements from the transcript and represented by the factor automaton to identify times at which the one or more of language elements from the transcript occur in the audio data;

    aligning a portion of the transcript with a portion of the audio data using the identified times; and

    outputting the aligned portion of the transcript.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×