×

Secure information classification

  • US 8,751,424 B1
  • Filed: 12/15/2011
  • Issued: 06/10/2014
  • Est. Priority Date: 12/15/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method for using a system to manage documents sensitive or classified content with a predetermined classifier threshold, comprising:

  • (a) extracting, from a security policy guide or other informal set of rules, a list of text features;

    (b) enabling interaction with a user configuring the system to create a rule-based classifier based on the list of text features and one or more synonymous features that capture sensitive or classified information in the security policy guide or the other informal set of rules;

    (c) applying the rule-based classifier to one or more selected documents to tag a set of documents with the sensitive or classified information they contain to generate tagged documents;

    (d) training a statistical text classifier using the tagged documents generated in (c) as a training set;

    (e) applying the statistical text classifier to the training set to suggest additional documents that should be tagged and to generate additional text features for detecting the sensitive or classified information;

    (f) providing the additional documents and the additional text features to a user interface for review and comparison by the user to update the training set and the list of text features and the one or more synonymous features;

    (g) refining the rule-based classifier based on the training set, the list of text features, and the one or more synonymous features generated in (f); and

    (h) repeating operations (b) through (g) until a classification scheme satisfies the predetermined classifier threshold.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×