×

Method for aligning text with audio signals

  • US 6,076,059 A
  • Filed: 08/29/1997
  • Issued: 06/13/2000
  • Est. Priority Date: 08/29/1997
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computerized method for aligning text segments of a text file with audio segments of an audio file, comprising the steps of:

  • generating a vocabulary and language model from the text file, generation of said model involving determination of relative probabilities of all one, two, and three word sequences in all unaligned text segments of the text file based upon frequencies of occurrences of said sequences in said unaligned text segments, all of said text segments being initially classified as unaligned text segments;

    recognizing a word list from the audio segments using the vocabulary and language model but without considering the text file;

    aligning the word list with the text segments based upon respective scores for all possible alignments of words in the word list with the text segments, each respective score being weighted to increase each respective score by a relatively greater amount if a respective alignment associated with the respective score involves relatively longer sequences of correctly aligned words;

    choosing corresponding anchors in the word list and text segments in accordance with the respective scores;

    partitioning the text and the audio segments into unaligned and aligned text and audio segments according to the anchors; and

    repeating the generating, recognizing, aligning, choosing, and partitioning steps with the unaligned text and audio segments until a termination condition is reached.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×