Systems and methods for classifying and transferring information in a storage network
First Claim
1. A method for updating a centralized database of metadata in a network comprising multiple computing devices storing multiple data objects, wherein the method is performed by one or more computing systems, each computing system having a processor and memory, the method comprising:
- receiving at a first time from one or more of the multiple computing devices multiple entries for the centralized database of metadata,wherein an entry includes information identifying a data object and information identifying one or more data classifications associated with the data object, wherein the one or more data classifications are generated using one or more characteristics of the data object, andwherein one or more of the multiple computing devices had been disconnected from the network for a period of time prior to the first time;
determining one or more rules based on one or more policies to apply to the multiple entries, wherein the one or more policies provide an order of preference by which the multiple entries are to be integrated into the centralized database of metadata;
applying the one or more rules to the multiple entries;
based on the application of the one or more rules, after the period of time where the one or more of the multiple computing devices were disconnected from the network, determining by the one or more computing systems, the order of preference by which to integrate the multiple entries into the centralized database of metadata; and
integrating the multiple entries into the centralized database of metadata according to the determined order of preference, wherein integrating includes—
searching, within the centralized database of metadata, for metadata entries previously associated with a first data object;
if metadata entries previously associated with the first data object are not found, then;
adding metadata entries associated with the first data object to the centralized database of metadata;
orif metadata entries previously associated with the first data object are found, then;
updating the metadata entries in the centralized database of metadata with information associated with data classifications for the data objects based on the information identifying the first data object and information identifying the one or mere data classifications associated with the first data object.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for data classification to facilitate and improve data management within an enterprise are described. The disclosed systems and methods evaluate and define data management operations based on data characteristics rather than data location, among other things. Also provided are methods for generating a data structure of metadata that describes system data and storage operations. This data structure may be consulted to determine changes in system data rather than scanning the data files themselves.
-
Citations
20 Claims
-
1. A method for updating a centralized database of metadata in a network comprising multiple computing devices storing multiple data objects, wherein the method is performed by one or more computing systems, each computing system having a processor and memory, the method comprising:
-
receiving at a first time from one or more of the multiple computing devices multiple entries for the centralized database of metadata, wherein an entry includes information identifying a data object and information identifying one or more data classifications associated with the data object, wherein the one or more data classifications are generated using one or more characteristics of the data object, and wherein one or more of the multiple computing devices had been disconnected from the network for a period of time prior to the first time; determining one or more rules based on one or more policies to apply to the multiple entries, wherein the one or more policies provide an order of preference by which the multiple entries are to be integrated into the centralized database of metadata; applying the one or more rules to the multiple entries; based on the application of the one or more rules, after the period of time where the one or more of the multiple computing devices were disconnected from the network, determining by the one or more computing systems, the order of preference by which to integrate the multiple entries into the centralized database of metadata; and integrating the multiple entries into the centralized database of metadata according to the determined order of preference, wherein integrating includes— searching, within the centralized database of metadata, for metadata entries previously associated with a first data object; if metadata entries previously associated with the first data object are not found, then; adding metadata entries associated with the first data object to the centralized database of metadata;
orif metadata entries previously associated with the first data object are found, then; updating the metadata entries in the centralized database of metadata with information associated with data classifications for the data objects based on the information identifying the first data object and information identifying the one or mere data classifications associated with the first data object. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer system for updating a database of metadata associated with data objects stored in a data store, the computer system comprising:
-
a processor; a memory; a data object monitoring component, wherein the data object monitoring component is configured to monitor a data store containing data objects for changes to the data store; a data object classification component, in communication with the data object monitoring component, wherein the data object classification component is configured to; receive from the data object monitoring component information identifying multiple data objects; determine one or more characteristics of the multiple data objects; and generate, based on the one or more characteristics, one or more data classifications for the multiple data objects; a database search component, in communication with the data object classification component, wherein the database search component is configured to search a database for database entries associated with the multiple data objects, wherein the database entries store information identifying multiple data objects and one or more data classifications for the multiple data objects; and a metadata integration component in communication with the database object classification component, wherein the metadata generation component is configured to; determine multiple database entries, wherein the multiple database entries include the information identifying the multiple data objects and the one or more data classifications for the multiple data objects; receive one or more rules to apply to the multiple database entries, wherein the one or more rules provide an order of preference by which the multiple database entries are to be integrated into the apply the one or more rules to the multiple database entries; based on the application of the one or more rules, determine the order of preference by which to integrate the multiple database entries into the database; and integrate the multiple database entries into the database according to the determined order of preference Wherein the metadata integration component integrates the multiple database entries into the database beginning at a first time, and Wherein the database had been disconnected from the computer system for a period of time prior to the first time. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A computer system for updating a database of metadata associated with data stored in a data store, the computer system comprising:
-
a processor; a memory; a data monitoring component, wherein the data monitoring component is configured to monitor a data store containing data objects for changes to the data objects; a data classification component, in communication with the data monitoring component, wherein the data classification component is configured to; receive information from the data monitoring component that identifies multiple data objects; and determine information that describes one or more characteristics associated with the multiple data objects; and generate, based on the one or more characteristics, one or more data classifications for the multiple data objects; a metadata generation component in communication with the data classification component, wherein the metadata generation component is configured to generate multiple metadata entries for the multiple data objects based on the one or more data classifications associated with the multiple data objects; a metadata update component in communication with the metadata generation component, wherein the metadata update component is configured to; receive one or more rules to apply to the multiple metadata entries, wherein the one or more rules provide an order of preference by which the multiple metadata entries are to be integrated into the database of metadata; apply the one or more rules to the multiple metadata entries; based on the application of the one or more rules, determine the order of preference by which to integrate the multiple metadata entries into the database of metadata; and integrate the multiple metadata entries into the database of metadata according to the determined order of preference; and Wherein the metadata update component integrates the multiple metadata entries into the database of metadata beginning at a first time, and Wherein the database of metadata had been disconnected from the computer system for a period of time prior to the first time a data storage component, wherein the data storage component is configured to store a secondary copy of the data objects in a secondary data store along with a secondary copy of the multiple metadata entries generated by the metadata generation component. - View Dependent Claims (17)
-
-
18. A computer-readable storage medium whose stored contents cause a data storage system to perform a method for updating metadata in a metabase, the method comprising:
-
receiving an indication of changes made to data objects contained by a data store managed by the data storage system; and for data objects that have changed; identifying one or more characteristics of the changed data objects; based upon the identified one or more characteristics, generating one or more classifications for the changed data objects; generating multiple metadata entries, wherein a metadata entry includes information identifying a changed data object and the one or more classifications generated for the chanced data object; receiving one or more rules to apply to the multiple metadata entries, wherein the one or more rules provide an order of preference by which the multiple metadata entries are to be integrated into a metabase associated with the data store; applying the one or more rules to the multiple metadata entries; based on the application of the one or more rules, determining the order of preference by which to integrate the multiple metadata entries into the metabase; and integrating the multiple metadata entries into the metabase according to the determined order of preference b3 Wherein the multiple metadata entries are integrated into the metabase beginning at a first time, and Wherein the metabase had been disconnected from the data storage system for a period of time prior to the first time. - View Dependent Claims (19, 20)
-
Specification