Metabase for facilitating data classification
First Claim
1. A method for managing electronic data in one or more storage devices, the method comprising:
- monitoring data interactions performed by a file system between a memory and at least one primary storage device, wherein the data interactions comprise at least read-only operations and write operations;
receiving from the monitoring module an index of the data interactions made by the one or more applications to one or more data objects on the at least one primary storage device;
classifying data interactions that comprise at least the read-only operations and write operations as entries for a metabase;
accessing the data objects on the primary storage device associated with the classified data interactions to obtain metadata associated with the data interactions from the data objects; and
storing in a first metabase the metadata obtained from the data objects on the primary storage device, wherein the first metabase is stored separately from the file system associated;
migrating a plurality of data objects from the primary storage device to a secondary storage device and migrating at least portion of the associated metadata in the first metabase to a second metabase, wherein the second metabase is stored separately from the first metabase and separately from the file system;
retaining in the first metabase, metadata about the data objects migrated to the secondary storage device such that information about the data objects migrated to the secondary device can be obtained from either the first the metabase or the second metabase without accessing the data objects on the secondary storage device via the file system.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for managing electronic data are disclosed. Various data management operations can be performed based on a metabase formed from metadata. Such metadata can be identified from an index of data interactions generated by a journaling module, and obtained from their associated data objects stored in one or more storage devices. In various embodiments, such processing of the index and storing of the metadata can facilitate, for example, enhanced data management operations, enhanced data identification operations, enhanced storage operations, data classification for organizing and storing the metadata, cataloging of metadata for the stored metadata, and/or user interfaces for managing data. In various embodiments, the metabase can be configured in different ways. For example, the metabase can be stored separately from the data objects so as to allow obtaining of information about the data objects without accessing the data objects or a data structure used by a file system.
929 Citations
32 Claims
-
1. A method for managing electronic data in one or more storage devices, the method comprising:
- monitoring data interactions performed by a file system between a memory and at least one primary storage device, wherein the data interactions comprise at least read-only operations and write operations;
receiving from the monitoring module an index of the data interactions made by the one or more applications to one or more data objects on the at least one primary storage device;
classifying data interactions that comprise at least the read-only operations and write operations as entries for a metabase;
accessing the data objects on the primary storage device associated with the classified data interactions to obtain metadata associated with the data interactions from the data objects; and
storing in a first metabase the metadata obtained from the data objects on the primary storage device, wherein the first metabase is stored separately from the file system associated;
migrating a plurality of data objects from the primary storage device to a secondary storage device and migrating at least portion of the associated metadata in the first metabase to a second metabase, wherein the second metabase is stored separately from the first metabase and separately from the file system;
retaining in the first metabase, metadata about the data objects migrated to the secondary storage device such that information about the data objects migrated to the secondary device can be obtained from either the first the metabase or the second metabase without accessing the data objects on the secondary storage device via the file system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
- monitoring data interactions performed by a file system between a memory and at least one primary storage device, wherein the data interactions comprise at least read-only operations and write operations;
-
15. A system for managing electronic data in a network environment, the system comprising:
- at least one primary storage device configured to store one or more data objects;
a journal file populated by a monitoring module, wherein the monitoring module is configured to monitor data interactions performed by a file system between a memory and the at least one primary storage device, the journal file comprising a list of data interactions between at least one application and at least of one of the data objects, wherein the data interactions comprise read-only operations and write operations;
a classification module executing in one or more computer processors, the classification module configured to process the journal file to classify data interactions with the data objects that comprise at least read-only operations and write operations as entries for a metabase, the classification module being further configured to obtain the metadata associated with the data objects from the data objects;
a first metabase comprising a plurality of records, wherein the first metabase is stored separately from the file system, and wherein at least one record in the first metabase stores the metadata associated with the data objects stored on the primary storage device, such that information about the data objects can be obtained from the first metabase without accessing the data objects via the file system at least one secondary storage device configured to store one or more data objects; and
a second metabase that stores metadata associated with the migration of one or more of the data objects from the at least one primary storage device to the at least one secondary storage device, wherein the second metabase is stored separately from the first metabase and separately from the file system, wherein at least a portion of the metadata in the second metabase is retained in the first metabase, such that information about the data objects migrated to the secondary device can be obtained from either the first metabase or the second metabase without accessing the data objects on the secondary storage device via the file system. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
- at least one primary storage device configured to store one or more data objects;
Specification