System and method for implementing a hierarchical data storage system
First Claim
Patent Images
1. A data storage system, comprising:
- a first data storage layer, the first data storage layer comprising data that can be accessed at a high data rate with a low latency;
a second data storage layer, said second data storage layer comprising de-duplicated data evicted from the first data storage layer, wherein the second storage layer has a higher retrieval latency than the first data storage layer;
a barrier storage area, the barrier storage area comprising a subset of the de-duplicated data that is stored for a settlement period, wherein the settlement period represents a time after which the de-duplicated data is made accessible; and
a third data storage layer coupled to the barrier storage area and configured to receive the subset of the de-duplicated data, said third data storage layer having a higher retrieval latency than said second data storage layer;
wherein a first data item is stored in said first, second, or third data storage layer based upon frequency of use of said first data item.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method for efficiently storing data both on-site and off-site in a cloud storage system. Data read and write requests are received by a cloud data storage system. The cloud storage system has at least three data storage layers. A first high-speed layer, a second efficient storage layer, and a third off-site storage layer. The first high-speed layer stores data in raw data blocks. The second efficient storage layer divides data blocks from the first layer into data slices and eliminates duplicate data slices. The third layer stores data slices at an off-site location.
38 Citations
20 Claims
-
1. A data storage system, comprising:
-
a first data storage layer, the first data storage layer comprising data that can be accessed at a high data rate with a low latency; a second data storage layer, said second data storage layer comprising de-duplicated data evicted from the first data storage layer, wherein the second storage layer has a higher retrieval latency than the first data storage layer; a barrier storage area, the barrier storage area comprising a subset of the de-duplicated data that is stored for a settlement period, wherein the settlement period represents a time after which the de-duplicated data is made accessible; and a third data storage layer coupled to the barrier storage area and configured to receive the subset of the de-duplicated data, said third data storage layer having a higher retrieval latency than said second data storage layer; wherein a first data item is stored in said first, second, or third data storage layer based upon frequency of use of said first data item. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system, comprising:
-
a first data storage layer in a memory, the first data storage layer comprising data blocks; a second data storage layer in the memory, the second data storage layer comprising de-duplicated data evicted from the first data storage layer, wherein the second storage layer has a higher retrieval latency than the first data storage layer; and a barrier storage area in the memory, the barrier storage area comprising a subset of the de-duplicated data that is stored for a settlement period before being deleted, wherein the subset of the de-duplicated data is selected for transfer to a third storage layer operated by a cloud storage provider, wherein the third data storage layer has a higher retrieval latency than the second data storage layer. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. A memory device having program instructions stored thereon that, upon execution by a computer system, cause the computer system to provide:
-
a first data storage layer comprising data blocks; a second data storage layer comprising de-duplicated data evicted from the first data storage layer, wherein the second storage layer has a higher retrieval latency first data storage layer; and a barrier storage area comprising a subset of the de-duplicated data that is stored for a settlement period after which the subset of the de-duplicated data is deleted, wherein the subset of the de-duplicated data is selected for transfer to a third storage layer operated by a cloud storage provider via a network, wherein the third data storage layer has a higher retrieval latency than the second data storage layer, wherein the settlement period is configurable dependent upon the cloud storage provider being used, wherein the settlement period represents a time after which the de-duplicated data stored by the cloud storage provider is made accessible, and wherein a given data item is stored in the first, second, or third data storage layer based upon frequency of use of the given data item.
-
Specification