×

Method and apparatus for adaptively generating field of application dependent language models for use in intelligent systems

  • US 5,444,617 A
  • Filed: 12/14/1993
  • Issued: 08/22/1995
  • Est. Priority Date: 12/17/1992
  • Status: Expired due to Fees
First Claim
Patent Images

1. An improved method for constructing a target field dependent model in the form of a decision tree for an intelligent machine, the operation of said machine is based on statistical approaches for converting input data from a source type of information into a target type of information using said decision tree, said method including:

  • storing in a data base a set of application field dependent files including words and symbols, thereby constituting a corpus;

    performing a vocabulary selection by deriving from said corpus, a list of most frequent words and symbols;

    scanning said words and symbols, and deriving therefrom a plurality of frequencies of occurrence of n-grams, which are sequences of a predefined number "n" of words and symbols, and storing said plurality of frequencies into an n-grams table;

    constructing said decision tree by;

    a) putting all selected vocabulary words and symbols into a first unique class C, said class initially constituting the only element of a set of classes;

    then,b) splitting each class of said set of classes into two subclasses C1 and C2, and assigning, through an iterative process, each word and symbol to one of said subclasses C1 and C2, based on the plurality of frequencies in said n-grams table;

    c) computing for each subclass C1 and C2 word and symbol "x", a distance d1 and a distance d2 relative to each subclass C1 and C2, respectively, wherein said distances d1 and d2 are derived as follows;

    ##EQU11## wherein V is the number of words in the vocabulary, and ##EQU12## wherein C is a counter of all n-grams among x1, . . . Xn-1, y and where the summation is taken over all contexts (x1 . . . xn-1) such that xj =x, and NTotal is the size of the class to be partitioned, ##EQU13## the summation in the numerator being taken over all contexts (x1 . . . xn-1), where xj belongs to C1 ; and

    ,the summation in the denominator being taken over all contexts where xj belongs to C1 and over all possible values of z from 0 to V-1, ##EQU14## the summation in the numerator being taken over all contexts (x1 . . . xn-1), where xj belongs to C2 ; and

    ,the summation in the denominator being taken over all contexts (x1 . . . xn-1) where xj belongs to C2 and over all possible values of z from 0 to V-1;

    
    
    space="preserve" listing-type="equation">Φ

    [p]=Log.sub.2 p if p>

    ε

    
    
    space="preserve" listing-type="equation">Φ

    [p]=(p/ε

    )-1+Log.sub.2 (ε

    ) if p<

    ε

    with ε

    =[min p(x,y)]2where the minimum is taken over all non-zero values of p(x,y), in which case,
    
    
    space="preserve" listing-type="equation">Φ

    []= 2Log [min p(x,y)]-1d) reclassifying "x" based on the shorter distance of d1 and d2 ; and

    e) testing each subclass C1 and C2 and deciding based on a predefined criteria, whether each class of the set of classes should be split any further; and

    , in case of any further split requirement, repeating said steps b) through e) thus increasing the number of elements in said set of classes.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×