Strategies for sanitizing data items
First Claim
1. A method for sanitizing restricted data items in a data set to prevent the revelation of the restricted data items, comprising:
- transferring an original data set from a production environment to a sanitizer, the original data set characterized by a state;
sanitizing the original data set using the sanitizer, while preserving the state of the original data set, comprising;
identifying the locations of the restricted data items in the original data set;
identifying at least one sanitizing tool to apply to the restricted data items which have been located in the original data set; and
applying said at least one sanitizing tool to the restricted data items which have been located in the original data set to provide a sanitized data set; and
forwarding the sanitized data set to a target environment.
2 Assignments
0 Petitions
Accused Products
Abstract
Strategies are described for sanitizing a data set, having the effect of obscuring restricted data in the data set to maintain its secrecy. The strategies operate by providing a production data set to a sanitizer. The sanitizer applies a data directory table to identify the location of restricted data items in the data set and to identify the respective sanitization tools to be applied to the restricted data items. The sanitizer then applies the identified sanitization tools to the identified restricted data items to produce a sanitized data set. A test environment receives the sanitized data set and performs testing, data mining, or some other application on the basis of the sanitized data set. Performing sanitization on a sanitized version of the production data set is advantageous because it preserves the state of the production data set. The data directory table also provides a flexible mechanism for applying sanitization tools to the production data set.
94 Citations
25 Claims
-
1. A method for sanitizing restricted data items in a data set to prevent the revelation of the restricted data items, comprising:
-
transferring an original data set from a production environment to a sanitizer, the original data set characterized by a state;
sanitizing the original data set using the sanitizer, while preserving the state of the original data set, comprising;
identifying the locations of the restricted data items in the original data set;
identifying at least one sanitizing tool to apply to the restricted data items which have been located in the original data set; and
applying said at least one sanitizing tool to the restricted data items which have been located in the original data set to provide a sanitized data set; and
forwarding the sanitized data set to a target environment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for sanitizing restricted data items in a data set to prevent the revelation of the restricted data items, comprising:
-
a production environment which relies on a production data set to perform its allotted functions;
a sanitizer configured to receive an original data set based on the production data set and to sanitize the original data set by;
(a) identifying the locations of the restricted data items in the original data set;
(b) identifying at least one sanitizing tool to apply to the restricted data items which have been located in the original data set; and
(c) applying said at least one sanitizing tool to the restricted data items which have been located in the original data set to provide a sanitized data set; and
a target environment configured to receive the sanitized data set, wherein the sanitizing preserves a state of the original data set.
-
-
17. A sanitizer for sanitizing restricted data items in a data set to prevent the revelation of the restricted data items, comprising:
-
a sanitizing module configured to receive an original data set based on a production data set used in a production environment, and to sanitize the original data set by;
(a) identifying the locations of the restricted data items in the original data set;
(b) identifying at least one sanitizing tool to apply to the restricted data items which have been located in the original data set; and
(c) applying said at least one sanitizing tool to the restricted data items which have been located in the original data set to provide a sanitized data set,wherein the sanitizing module is configured to sanitize the original data set while preserving a state of the original data set. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25)
-
Specification