×

Method and apparatus for auditing training supersets

  • US 7,107,266 B1
  • Filed: 10/24/2001
  • Issued: 09/12/2006
  • Est. Priority Date: 11/09/2000
  • Status: Expired due to Term
First Claim
Patent Images

1. A computer assisted method of auditing a superset of training data, the superset comprising examples of documents having one or more preexisting category assignments, the method including:

  • partitioning the superset into at least two disjoint sets, including a test set and a training set, wherein the test set includes one or more test documents and the training set includes examples of documents belonging to at least two categories;

    automatically categorizing the test documents using the training set;

    calculating a metric of confidence based on results of the categorizing step and comparing the automatic category assignments for the test documents to the preexisting category assignments; and

    reporting the test documents and preexisting category assignments that are suspicious and the automatic category assignments that appear to be missing from the test documents, based on the metric of confidence.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×