×

Method for transforming data elements within a classification system based in part on input from a human annotator/expert

  • US 7,756,800 B2
  • Filed: 12/14/2006
  • Issued: 07/13/2010
  • Est. Priority Date: 12/14/2006
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for transforming computer stored records comprised of at least one data item in a document in a data collection or an element instance within a document into a classification system, the method comprising:

  • retrieving documents stored as computer records;

    associating with any element instance within the data item in the document a set of distinctive features indicative of accurate identification and classification of the element instance;

    providing a training set comprising a subset of elements within the document labeled with class labels;

    providing a learning method comprised of a basic learning algorithm and also trained with annotated elements to produce a result of predicting class labels to be assigned to unlabeled elements;

    identifying an element instance within the unlabeled elements including predicting a class of selected element instances;

    computing a confidence factor that a selected element instance is accurately identified by the predicted class;

    for a selected element instance having a confidence factor less than zero, querying a concept evolution command emitting human annotator/expert for a true class label of the instance;

    the querying comprising the annotator/expert generating a concept evolution command for redefining the unlabeled element wherein the generating the concept evolution command comprises adjusting at least one associated feature of the training set in an incremental manner;

    the adjusting at least one associated feature of the training set comprising a local approach concept evolution comprising associating the local model for each evolution event including a concept evolution command wherein the associating local model comprises the maintenance of the concept evolution directed acyclic graph (DAG) and corresponding an event model to an internal node of the concept evolution DAG; and

    extending the training set to include an as yet not included true labeled instance and iterating the identifying and computing for another element instance with a low confidence factor;

    wherein the learning model comprises a global approach to reshape a list of the classes and extends the set of features, or wherein the learning method comprises a local approach that creates a local model of one or few events, the definition set of classes remains unchanged, and the training set can be extended with new examples.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×