Rule-based tiered storage system
First Claim
Patent Images
1. A device, comprising:
- a processor communicatively coupled to a memory, the processor configured to;
receive a set of relevance classes for tagging data segments;
receive a first data segment including a first metadata to be stored in a storage unit, wherein the storage unit comprises at least two storage tiers;
wherein the first metadata comprises a set of variables;
wherein the set of variables includes a location of the first data segment in a directory structure, a group to which a user of the first data segment belongs, and at least one or more of;
a user of the data segment, a size of the data segment, an access control list for the data segment, a time of generation of the data segment, a time of last modification of the data segment, a time of last access of the data segment, a name of the data segment, an indication of whether structured or unstructured data are contained in the data segment, and an indication of a required data retention time;
receive other metadata for other data segments stored in the storage unit;
determine an event associated with the first data segment, the event comprising an occurrence of one or more pre-defined events;
remove at least one relevance class from the set of relevance classes to create an adjusted set of relevance classes based on the event;
determine an observable comprising two or more variables of the set of variables for the first data segment;
determine, for each value of the observable, a distribution of stored data segments across the adjusted set of relevance classes utilizing an agglomerative information bottleneck algorithm;
assign a relevance class out of the adjusted set of relevance classes to the first data segment based on one or more results of the agglomerative information bottleneck algorithm;
determine a storage tier of the at least two storage tiers to store the first data segment based on the other metadata of the other data segments stored in the storage tier, the relevance class, the first metadata of the first data segment, and a content of the first data segment;
determine a protection level for the first data segment based on one or more of the first metadata of the first data segment, the other metadata of the other data segments stored in the storage unit, and the content of the first data segment; and
storing the first data segment on the storage tier with the protection level, wherein the first data segment is encoded with the protection level by an encoder prior to being stored on the storage tier.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for storing a data segment in a storage tier of a storage unit comprising at least two storage tiers includes receiving the data segment to be stored including metadata; receiving metadata of data segments stored in the storage unit; and determining the storage tier to store the received data segment to and a protection level dependent on the metadata received and dependent on the metadata of the received data segment.
82 Citations
4 Claims
-
1. A device, comprising:
-
a processor communicatively coupled to a memory, the processor configured to; receive a set of relevance classes for tagging data segments; receive a first data segment including a first metadata to be stored in a storage unit, wherein the storage unit comprises at least two storage tiers; wherein the first metadata comprises a set of variables; wherein the set of variables includes a location of the first data segment in a directory structure, a group to which a user of the first data segment belongs, and at least one or more of;
a user of the data segment, a size of the data segment, an access control list for the data segment, a time of generation of the data segment, a time of last modification of the data segment, a time of last access of the data segment, a name of the data segment, an indication of whether structured or unstructured data are contained in the data segment, and an indication of a required data retention time;receive other metadata for other data segments stored in the storage unit; determine an event associated with the first data segment, the event comprising an occurrence of one or more pre-defined events; remove at least one relevance class from the set of relevance classes to create an adjusted set of relevance classes based on the event; determine an observable comprising two or more variables of the set of variables for the first data segment; determine, for each value of the observable, a distribution of stored data segments across the adjusted set of relevance classes utilizing an agglomerative information bottleneck algorithm; assign a relevance class out of the adjusted set of relevance classes to the first data segment based on one or more results of the agglomerative information bottleneck algorithm; determine a storage tier of the at least two storage tiers to store the first data segment based on the other metadata of the other data segments stored in the storage tier, the relevance class, the first metadata of the first data segment, and a content of the first data segment; determine a protection level for the first data segment based on one or more of the first metadata of the first data segment, the other metadata of the other data segments stored in the storage unit, and the content of the first data segment; and storing the first data segment on the storage tier with the protection level, wherein the first data segment is encoded with the protection level by an encoder prior to being stored on the storage tier.
-
-
2. A non-transitory computer readable storage medium having computer readable instructions stored thereon that, when executed by a computer, implement a method comprising:
-
receiving a first data segment including metadata to be stored in a storage unit, wherein the storage unit comprises at least two storage tiers; wherein the metadata of the first data segment comprises a set of variables; wherein the set of variables includes a location of the first data segment in a directory structure, a group to which a user of the first data segment belongs, and at least one or more of; a user of the data segment, a size of the data segment, an access control list for the data segment, a time of generation of the data segment, a time of last modification of the data segment, a time of last access of the data segment, a name of the data segment, an indication of whether structured or unstructured data are contained in the data segment, and an indication of a required data retention time; receiving other metadata for other data segments stored in the storage unit; determining an event associated with the first data segment, the event comprising an occurrence of one or more pre-defined events; removing at least one relevance class from a set of relevance classes to create an adjusted set of relevance classes based on the event; determining an observable comprising two or more variables of the set of variables for the first data segment; determining, for each value of the observable, a distribution of stored data segments across the adjusted set of relevance classes utilizing an agglomerative information bottleneck algorithm; assigning a relevance class out of the adjusted set of relevance classes to the first data segment based on one or more results of the agglomerative information bottleneck algorithm; determining a storage tier of the at least two storage tiers to store the first data segment based on the other metadata of the other data segments stored in the storage tier, the relevance class, the first metadata of the first data segment, and a content of the first data segment; determining a protection level to apply to the first data segment based on one or more of the first metadata of the first data segment, the other metadata of the other data segments stored in the storage unit, and the content of the received data segment; and storing the first data segment on the storage tier with the protection level, wherein the first data segment is encoded with the protection level by an encoder prior to being stored on the storage tier. - View Dependent Claims (3, 4)
-
Specification