Systems and methods for classifying and transferring information in a storage network
First Claim
Patent Images
1. A method for accessing data associated with a first computing device using a second computing device, comprising:
- accessing data associated with a first computing device;
receiving metadata associated with the accessed data, wherein the received metadata includes metadata other than file system metadata and other than metadata identifying logical locations of the data;
storing the received metadata at a second computing device that is distinct from the first computing device;
analyzing the metadata stored at the second computing device to identify a set of the accessed data, wherein the set of the accessed data is less than the accessed data;
based on the analysis of the metadata, associating one or more data classifications with a subset of the accessed data, wherein—
the subset of accessed data is less than the set of accessed data, andthe one or more data classifications describe characteristics of the accessed data,receiving a data management request, wherein the data management request includes one or more criteria and comprises a request to—
identify the subset of the accessed data based on the one or more data classifications, andperform a data storage operation on the identified subset of the accessed data; and
,performing the data storage operation requested by the data management request on the identified subset of the accessed data.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for data classification to facilitate and improve data management within an enterprise are described. The disclosed systems and methods evaluate and define data management operations based on data characteristics rather than data location, among other things. Also provided are methods for generating a data structure of metadata that describes system data and storage operations. This data structure may be consulted to determine changes in system data rather than scanning the data files themselves.
-
Citations
18 Claims
-
1. A method for accessing data associated with a first computing device using a second computing device, comprising:
-
accessing data associated with a first computing device; receiving metadata associated with the accessed data, wherein the received metadata includes metadata other than file system metadata and other than metadata identifying logical locations of the data; storing the received metadata at a second computing device that is distinct from the first computing device; analyzing the metadata stored at the second computing device to identify a set of the accessed data, wherein the set of the accessed data is less than the accessed data; based on the analysis of the metadata, associating one or more data classifications with a subset of the accessed data, wherein— the subset of accessed data is less than the set of accessed data, and the one or more data classifications describe characteristics of the accessed data, receiving a data management request, wherein the data management request includes one or more criteria and comprises a request to— identify the subset of the accessed data based on the one or more data classifications, and perform a data storage operation on the identified subset of the accessed data; and
,performing the data storage operation requested by the data management request on the identified subset of the accessed data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system to perform a data management operation on data stored at a first computing device using a second computing device, comprising:
-
a processor; a data discovery component configured to discover data stored on the first computing device, wherein metadata is associated with the discovered data; a data classification component configured to classify the discovered data with one or more classifications that identify characteristics of the discovered data, wherein the classification component is further configured to— analyze the metadata to identify a set of the discovered data, wherein the set of data includes less than the discovered data; based on the analysis of the metadata, associate one or more data classifications with a subset of the discovered data, wherein— the subset of data includes less than the set of data, and the metadata includes metadata other then file system metadata and other than metadata identifying logical locations of the discovered data; a metadata storing component configured to store the metadata and the classifications on the second computing device; and a data management operation component configured to perform a data management operation that first identifies data, and then performs an action on the identified data, wherein the data management operation component reduces a number of accesses of the first computing device by using the classifications stored within the second computing device to identify data related to the data management operation without accessing the first computing device. - View Dependent Claims (13, 14, 15)
-
-
16. A computer-readable storage medium encoded with instructions for accessing data stored at a first data processing system using a second data processing system, wherein the instructions when executed by a processor cause the processor to perform a method comprising:
-
identifying data stored on the first data processing system; categorizing the identified data to create one or more data categorizations that specify information about the identified data for finding the data in response to a data management request, wherein the one or more data categorizations include descriptive data other than file system metadata and other than metadata identifying logical locations of the identified data, wherein the categorizing includes— analyzing metadata associated with the identified data to identify a set of the identified data, wherein the set of data is less than the identified data, and, based on the analysis of the metadata, associating the one or more data categorizations with a subset of the identified data, wherein the subset of data is less than the set of data; storing the descriptive data on the second data processing system; and analyzing the descriptive data at the second data processing system to find data in response to a data management request, wherein an analysis regarding the data at the first data processing system is performed without accessing the data at the first data processing system. - View Dependent Claims (17, 18)
-
Specification