Data Classification Aware Object Storage
First Claim
1. A non-transitory computer-readable storage medium storing computer-executable instructions that when executed by a computer cause the computer to perform a method, the method comprising:
- accessing data that is to be stored in an object store, where the object store is configured with two or more data destinations, where different data destinations have different data storage policies;
classifying the data by identifying a value for an attribute of the data;
selecting a data storage policy associated with a member of the two or more data destinations based, at least in part, on the value of the attribute, andproviding the data to a member of the two or more data destinations that is associated with the data storage policy.
7 Assignments
0 Petitions
Accused Products
Abstract
Example apparatus and methods process data that is going to be stored in an object store. The object store may have multiple data destinations (e.g., “buckets”). Different buckets have different data storage policies that control, for example, how many copies of the data will be made, whether the data will be stored onsite or offsite, or other storage parameters. Data may be classified by identifying a value for an attribute (e.g., file type, file source) of the data. A storage policy associated with a bucket may then be selected based on the attribute. Once the storage policy has been selected, then the data may be provided to a bucket associated with the storage policy. The number of buckets, data classifications, or storage policies may be updated by adaptive parameterization that considers the amount or type of data observed and stored in the object store.
49 Citations
21 Claims
-
1. A non-transitory computer-readable storage medium storing computer-executable instructions that when executed by a computer cause the computer to perform a method, the method comprising:
-
accessing data that is to be stored in an object store, where the object store is configured with two or more data destinations, where different data destinations have different data storage policies; classifying the data by identifying a value for an attribute of the data; selecting a data storage policy associated with a member of the two or more data destinations based, at least in part, on the value of the attribute, and providing the data to a member of the two or more data destinations that is associated with the data storage policy. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. An apparatus, comprising:
-
a processor; a memory; a set of logics that control storage of data in an object store configured with two or more buckets, where the set of logics cause an item to be stored in a member of the two or more buckets based, at least in part, on a set of data classifications; and an interface that connects the processor, the memory, and the set of logics; the set of logics comprising; a first logic that produces a classification of the item to be stored by the object store; a second logic that selects a bucket from the two or more buckets based, at least in part, on the classification; and a third logic that controls the item to be provided to the bucket. - View Dependent Claims (17, 18, 19, 20)
-
-
21. An object store, comprising:
-
means for partitioning an object store into a plurality of data stores, where a member of the plurality of data stores is associated with a unique addressable namespace and a set of storage parameters; means for dynamically establishing the set of storage parameters for a member of the plurality of data stores; means for identifying a set of attributes for a file to be stored in a member of the plurality of data stores; and means for selecting a member of the plurality of data stores to store the file based, at least in part, on a comparison of the set of attributes and the set of storage parameters for the member of the plurality of data stores.
-
Specification