Enhancing Analytics Performance Using Distributed Multi-Tiering
First Claim
1. A method comprising:
- in a computer cluster in communication with a storage system, the storage system organized in a hierarchy, assigning a different role to each storage tier within the hierarchy, each role related to a workload characteristic;
managing the tiers across the cluster, the management responsive to a flexible tier definition within the hierarchy;
calculating probability of access to a data object, selecting a tier for placement of the data object based upon the calculation, and placing the data object in one of the storage tiers in the hierarchy responsive to the flexible tier definition; and
computing placement of a new data object within the hierarchy to minimize system runtime, including optimizing placement of the new data object in response to an access request such that I/O response time is optimized.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments of the invention relate to cluster-centric tiered storage with a flexible tier definition to support performance of transactions. Object data is distributed in a multi-tiered shared-nothing cluster. Hierarchical tiers of data storage are assigned different roles within the hierarchy. The tiers are managed globally across the cluster and objects are placed in tiers according to a flexible tier definition. The probability of object access is computed for objects, and objects are moved to different tiers responsive to the computation to minimize system runtime. The location of an object is further optimized in response to an access request.
52 Citations
25 Claims
-
1. A method comprising:
-
in a computer cluster in communication with a storage system, the storage system organized in a hierarchy, assigning a different role to each storage tier within the hierarchy, each role related to a workload characteristic; managing the tiers across the cluster, the management responsive to a flexible tier definition within the hierarchy; calculating probability of access to a data object, selecting a tier for placement of the data object based upon the calculation, and placing the data object in one of the storage tiers in the hierarchy responsive to the flexible tier definition; and computing placement of a new data object within the hierarchy to minimize system runtime, including optimizing placement of the new data object in response to an access request such that I/O response time is optimized. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product for use with objects in a computer cluster, the computer program product comprising a computer-readable storage medium having program code embodied therewith, the program code being executable by a processor to:
-
organize a storage system in communication with the cluster in a hierarchy, including assigning a different role to each storage tier within the hierarchy, each role related to a workload characteristic; manage the tiers across the cluster, responsive to a flexible tier definition within the hierarchy; calculate probability of access to a data object, select a tier for placement of the data object, the selection based on the calculation, and place the data object in one of the storage tiers in the hierarchy in response to the flexible tier definition; and compute placement of a new data within the hierarchy to minimize system runtime, including optimizing the location of the new object in response to an access request. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A system comprising:
-
two or more nodes in a computer cluster in communication with a storage system, the storage system organized in a hierarchy and a flexible tier definition within the hierarchy, each tier assigned a different role within the hierarchy; a filesystem in communication with the storage system, the file system to manage the tiers across the cluster responsive to the flexible tier definition, including; placement of one or more objects in a select tier in the hierarchy responsive to the flexible tier definition; and compute new data placement within the hierarchy to minimize system runtime, including to optimize the location of the new object in response to an access request. - View Dependent Claims (17, 18, 19, 20)
-
-
21. A method comprising:
-
in a computer cluster in communication with a storage system organized in a hierarchy, assigning different roles to each storage tier within the hierarchy; placing one or more objects in a select storage tier in the hierarchy responsive to a flexible tier definition; and managing the tiers globally across the cluster, including one or more objects assigned to a tier, the management responsive to a flexible tier definition within the hierarchy, including computing data placement within the hierarchy to minimize system runtime, and calculating probability of access to the object and optimizing the location of the object in response to an access request. - View Dependent Claims (22, 23, 24, 25)
-
Specification