×

Apparatus and method for forming a filtered inflected language model for automatic speech recognition

  • US 6,073,091 A
  • Filed: 08/06/1997
  • Issued: 06/06/2000
  • Est. Priority Date: 08/06/1997
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of forming a language model for a language having a selected vocabulary of word forms, the method comprising the steps of:

  • (a) mapping the word forms into integer vectors in accordance with frequencies of word form occurrence;

    (b) partitioning the integer vectors into subsets, the subsets respectively having ranges of frequencies of word form occurrence associated therewith, the subsets being arranged in a descending order of ranges;

    (c) respectively assigning maps to the subsets;

    (d) filtering a textual corpora using the maps assigned to the subsets in order to generate indexed integers;

    (e) determining n-gram statistics for the indexed integers;

    (f) estimating n-gram language model probabilities from the n-gram statistics to form the language model; and

    (g) determining a probability of a word sequence uttered by a speaker, using said language model.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×