MANAGING THE CREATION, DETECTION, AND MAINTENANCE OF SENSITIVE INFORMATION
First Claim
1. A method for managing information within an electronic file, the method comprising:
- analyzing a plurality of information sets within an electronic file;
comparing at least one of the information sets in the plurality of information sets to at least one statistical classification model, wherein the statistical classification model includes one or more probabilities associated with a plurality of analyzed information sets that indicate a likelihood that a respective analyzed information set is classified sensitive information;
determining that the at least one information set substantially matches at least one analyzed information set in the statistical classification model;
determining that a probability associated with the at least one analyzed information set is above a given threshold; and
classifying, in response to determining that the probability associated with the at least one analyzed information set is above a given threshold, the at least one information set as sensitive information.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, information processing system, and computer program storage product for managing information within an electronic file are provided. A plurality of information sets within an electronic file is analyzed. At least one of the information sets is compared to at least one statistical classification model. The statistical classification model includes one or more probabilities associated with a plurality of analyzed information sets that indicate a likelihood that a respective analyzed information set is classified sensitive information. The at least one information set is determined to substantially match at least one analyzed information set in the statistical classification model. The probability associated with the at least one analyzed information set is determined whether to be above a threshold. The at least one information set is classified as sensitive information in response to determining that the probability is above the threshold.
109 Citations
20 Claims
-
1. A method for managing information within an electronic file, the method comprising:
-
analyzing a plurality of information sets within an electronic file; comparing at least one of the information sets in the plurality of information sets to at least one statistical classification model, wherein the statistical classification model includes one or more probabilities associated with a plurality of analyzed information sets that indicate a likelihood that a respective analyzed information set is classified sensitive information; determining that the at least one information set substantially matches at least one analyzed information set in the statistical classification model; determining that a probability associated with the at least one analyzed information set is above a given threshold; and classifying, in response to determining that the probability associated with the at least one analyzed information set is above a given threshold, the at least one information set as sensitive information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An information processing system for managing information within an electronic file, the information processing system comprising:
-
a memory; a processor communicatively coupled to the memory; and an information manager communicatively coupled to the memory and the processor, wherein the information manager is adapted to; analyze a plurality of information sets within an electronic file; compare at least one of the information sets in the plurality of information sets to at least one statistical classification model, wherein the statistical classification model includes one or more probabilities associated with a plurality of analyzed information sets that indicate a likelihood that a respective analyzed information set is classified sensitive information; determine that the at least one information set substantially matches at least one analyzed information set in the statistical classification model; determine that a probability associated with the at least one analyzed information set is above a given threshold; and classify, in response the probability associated with the at least one analyzed information set being above a given threshold, the at least one information set as sensitive information. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computer program storage product for managing information within an electronic file, the computer program storage product comprising instructions for:
-
analyzing a plurality of information sets within an electronic file; comparing at least one of the information sets in the plurality of information sets to at least one statistical classification model, wherein the statistical classification model includes one or more probabilities associated with a plurality of analyzed information sets that indicate a likelihood that a respective analyzed information set is classified sensitive information; determining that the at least one information set substantially matches at least one analyzed information set in the statistical classification model; determining that a probability associated with the at least one analyzed information set is above a given threshold; and classifying, in response to determining that the probability associated with the at least one analyzed information set is above a given threshold, the at least one information set as sensitive information. - View Dependent Claims (17, 18, 19, 20)
-
Specification