×

Data classification apparatus and method thereof

  • US 7,072,873 B1
  • Filed: 11/09/1999
  • Issued: 07/04/2006
  • Est. Priority Date: 11/09/1998
  • Status: Expired due to Fees
First Claim
Patent Images

1. A data classification apparatus comprising:

  • an input device for receiving a plurality of training classified examples and at least one unclassified example;

    a memory for storing said classified and unclassified examples;

    an output terminal for outputting a predicted classification for said at least one unclassified example; and

    a processor for identifying the predicted classification of said at least one unclassified examplewherein the processor includes;

    classification allocation means for allocating potential classifications to each said unclassified example and for generating a plurality of classification sets, each said classification set containing said plurality (l) of training classified examples with their classification and said at least one unclassified example (l+1) with its said allocated potential classification;

    assay means including an example valuation device which determines individual strangeness values (α

    i) for each said training classified example (i=1,2 . . . l) and said at least one unclassified example (i=l+1) having an allocated potential classification (y), the assay means determining a single strangeness value (d(y)) valid under the independently and identically distributed assumption for each said classification set in dependence on said individual strangeness values (α

    i) of each example by the formula d

    ( y )
    =

    { i ;

    α

    i


    α

    l + 1
    }


    l + 1
    ,

    where



    i
    = 1
    , 2









    l
    , l + 1 ;

    a comparative device for selecting the classification set to which the most likely allocated potential classification for said at least one unclassified example belongs, wherein said predicted classification output by the output terminal is said most likely allocated classification according to said single strangeness values assigned by said assay means; and

    a strength of prediction monitoring device for determining a confidence value for said predicted classification on the basis of said single strangeness value assigned by said assay means to one of said classification sets to which the second most likely allocated potential classification of said at least one unclassified example belongs.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×