×

Method and system using separate context and constituent probabilities for speech recognition in languages with compound words

  • US 5,797,122 A
  • Filed: 11/18/1996
  • Issued: 08/18/1998
  • Est. Priority Date: 03/20/1995
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for speech recognition in languages with compound words, comprising the following steps:

  • storing phonetic transcriptions of words and components of compound words in a first storage area,calculating n-gram frequencies (language model) for the probability of a compound word within a sequence of N words with use of a previously processed body of text, and storing the frequencies in a second storage area;

    recording and digitizing the acoustic speech signal and storing the digitized speech signal in a third storage area, wherein by means of signal processing based on the phonetic transcriptions, approximately determining the words and boundaries of compound words and deriving hypothetical sequences of words or candidates for compound words therefrom;

    establishing separate processing paths for sequences of candidates for words and compound words;

    statistically evaluating the processing paths by means of the n-gram frequencies, where likelihood profiles are generated from the sequence of n-gram frequencies of words or components of compound words of each processing path; and

    fully evaluating the processing paths with regard to the goodness of acoustic fit and the statistical probability of the language model.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×