×

SYSTEM AND METHOD TO EXTRACT MODELS FROM SEMI-STRUCTURED DOCUMENTS

  • US 20120078969A1
  • Filed: 09/24/2010
  • Published: 03/29/2012
  • Est. Priority Date: 09/24/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method for producing a global model describing a collection of documents comprising:

  • accessing a collection of documents, the collection of documents comprising labeled documents and unlabeled documents;

    receiving input identifying indicative words for classifications;

    generating a classification model;

    classifying documents of the collection of documents to produce classified documents of one or more types;

    extracting concepts from the classified documents;

    generating a global model from the concepts; and

    outputting the global model.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×