Systems and methods for classifying and transferring information in a storage network
First Claim
Patent Images
1. A computer-implemented method for accessing data through a distributed metabase, comprising:
- receiving a request to access data stored on a storage device;
selecting at least one of multiple distributed metabases containing metadata related to the data stored on the storage device, wherein the metadata includes metadata other than file system metadata and other than metadata identifying logical locations of the data, and wherein the multiple metabases store metadata related to the data stored on the storage device, and wherein the selecting includes—
determining characteristics of the multiple distributed metabases,evaluating the characteristics against one or more criteria, andselecting the at least one metabase based on the evaluation;
analyzing the metadata stored at the at least one metabase to identify a set of data, wherein the set of data is less than the data stored on the storage device;
based on the analysis of the metadata, associating one or more data classifications with a subset of data, wherein—
the subset of data is less than the set of data, andthe one or more data classifications describe characteristics of at least some of the data stored on the storage device; and
,accessing the subset of data based on the metadata contained by the at least one selected metabase and the associated one or more data classifications.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for data classification to facilitate and improve data management within an enterprise are described. The disclosed systems and methods evaluate and define data management operations based on data characteristics rather than data location, among other things. Also provided are methods for generating a data structure of metadata that describes system data and storage operations. This data structure may be consulted to determine changes in system data rather than scanning the data files themselves.
257 Citations
22 Claims
-
1. A computer-implemented method for accessing data through a distributed metabase, comprising:
-
receiving a request to access data stored on a storage device; selecting at least one of multiple distributed metabases containing metadata related to the data stored on the storage device, wherein the metadata includes metadata other than file system metadata and other than metadata identifying logical locations of the data, and wherein the multiple metabases store metadata related to the data stored on the storage device, and wherein the selecting includes— determining characteristics of the multiple distributed metabases, evaluating the characteristics against one or more criteria, and selecting the at least one metabase based on the evaluation; analyzing the metadata stored at the at least one metabase to identify a set of data, wherein the set of data is less than the data stored on the storage device; based on the analysis of the metadata, associating one or more data classifications with a subset of data, wherein— the subset of data is less than the set of data, and the one or more data classifications describe characteristics of at least some of the data stored on the storage device; and
,accessing the subset of data based on the metadata contained by the at least one selected metabase and the associated one or more data classifications. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer-implemented method for accessing data through a distributed metabase, the method comprising:
-
receiving a request to access data stored on a storage device; selecting at least one of multiple distributed metabases based on at least one characteristic of the multiple metabases, wherein the metabases contain metadata associated with the data stored on the storage device, wherein the metadata includes metadata other than file system metadata and other than metadata identifying logical locations of the data, wherein the metadata facilitates performance of data storage operations on the data, and wherein the characteristic of each of the multiple metabases includes a past, current, or projected level of activity associated with each of the metabases; analyzing the metadata stored at the at least one metabase to identify a set of data, wherein the set of data is less than the data stored on the storage device; based on the analysis of the metadata, associating one or more data classifications with a subset of data, wherein— the subset of data is less than the set of data, and the one or more data classifications describe characteristics of at least some of the data stored on the storage device; and
,accessing the subset of data based on the metadata contained by the at least one selected metabase and the associated one or more data classifications. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A method for accessing data based on metadata stored in a network, the method comprising:
-
receiving multiple requests to access data stored on at least one of multiple storage devices accessible via the network; selecting at least one of multiple distributed metabases containing metadata related to data stored on the multiple storage devices, wherein the metadata includes metadata other than file system metadata and other than metadata identifying logical locations of the data, and wherein the classifications include information used to facilitate performance of a data storage operation on the data; sending at least one of the multiple requests to the at least one selected metabase, wherein sending the at least one request includes not sending at least some of the multiple requests to the multiple storage devices; and analyzing the metadata stored at the at least one selected metabase to identify a set of data, wherein the set of data is less than the data stored on the storage device; based on the analysis of the metadata, associating one or more data classifications with a subset of data, wherein— the subset of data is less than the set of data, and the one or more data classifications describe characteristics of at least some of the data stored on the storage device; and
,accessing the subset of data based on the metadata contained by the at least one selected metabase and the associated one or more data classifications.
-
-
19. A non-transitory computer-readable medium storing instruction, which when performed by a data processing device, perform a method for accessing data based on metadata stored in a network, the method comprising:
-
receiving multiple requests to access data stored on one or more storage devices accessible via the network; selecting at least one of multiple distributed metabases containing metadata related to the data stored on the one or more storage devices, wherein the metadata includes metadata other than file system metadata and other than metadata identifying logical locations of the data, and wherein the classifications include information used to facilitate performance of a data storage operation on the data; sending at least one of the multiple requests to the at least one selected metabase; and processing the metadata stored at the at least one selected metabase to identify a set of data, wherein the set of data is less than the data stored on the storage device; based on the processing of the metadata, associating one or more data classifications with a subset of data, wherein— the subset of data is less than the set of data, and the one or more data classifications describe characteristics of at least some of the data stored on the storage device; and
,accessing the subset of data based on the metadata contained by the at least one selected metabase and the associated one or more data classifications. - View Dependent Claims (20, 21, 22)
-
Specification