×

SYSTEMS AND METHODS FOR GENERATING MACHINE LEARNING-BASED CLASSIFIERS FOR DETECTING SPECIFIC CATEGORIES OF SENSITIVE INFORMATION

  • US 20120303558A1
  • Filed: 07/26/2011
  • Published: 11/29/2012
  • Est. Priority Date: 05/23/2011
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for generating machine learning-based classifiers for detecting specific categories of sensitive information, at least a portion of the method being performed by a computing device comprising at least one processor, the method comprising:

  • identifying a plurality of specific categories of sensitive information to be protected by a data loss prevention (DLP) system;

    obtaining a training data set for each specific category of sensitive information that comprises a plurality of positive examples of data that fall within the specific category of sensitive information and a plurality of negative examples of data that do not fall within the specific category of sensitive information;

    using machine learning to train, based on an analysis of the training data sets, at least one machine learning-based classifier that is configured to detect items of data that contain one or more of the plurality of specific categories of sensitive information;

    deploying the machine learning-based classifier within the DLP system to enable the DLP system to detect and protect items of data that contain one or more of the plurality of specific categories of sensitive information in accordance with at least one DLP policy of the DLP system.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×