REPLICATION TECHNIQUES WITH CONTENT ADDRESSABLE STORAGE
First Claim
1. A CAS data storage system comprising:
- at least one source CAS data storage spaceat least one destination CAS data storage space, anda communication line connecting said source storage space and said destination storage space, and wherein input data for storage in said system arrives at said source storage space for storage at said source storage space and to be replicated to said destination storage space, the source storage space further comprising an active buffer of incoming data for replication to said destination storage space, said active buffer configured to hold for each of a plurality of data items of said incoming data, a hash result of the respective data item and an address, the system being configured to detect whenever there is more than one data item in said active buffer sharing a same address and upon such detection to transfer a respective hash result of only a last of said data items sharing said same address, to said destination storage space.
10 Assignments
0 Petitions
Accused Products
Abstract
A CAS data storage system with one or more source CAS data storage spaces and one or more destination CAS data storage spaces, and a communication line therebetween, receives input data at the source storage space for local storage and for replication to the destination CAS storage space. CAS metadata is used in the replication procedure between the two separate CAS storage spaces. Thus, data at the source storage space is used to form an active buffer for transfer to the destination storage space, the active buffer holding a hash result of the respective data item and a storage address. The system detects whenever there is more than one data item in said active buffer sharing a same storage address and upon such detection transfers a respective hash result of only the last of the data items.
-
Citations
25 Claims
-
1. A CAS data storage system comprising:
-
at least one source CAS data storage space at least one destination CAS data storage space, and a communication line connecting said source storage space and said destination storage space, and wherein input data for storage in said system arrives at said source storage space for storage at said source storage space and to be replicated to said destination storage space, the source storage space further comprising an active buffer of incoming data for replication to said destination storage space, said active buffer configured to hold for each of a plurality of data items of said incoming data, a hash result of the respective data item and an address, the system being configured to detect whenever there is more than one data item in said active buffer sharing a same address and upon such detection to transfer a respective hash result of only a last of said data items sharing said same address, to said destination storage space. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
- 8. A CAS data storage system comprising at least one source CAS data storage space and a destination CAS data storage space, and a communication line connecting said source storage space and said destination storage space, and wherein input data for storage in said system arrives at said source storage space for storage in said source storage space and replication in said destination storage space, and wherein a hash key of a data item is transferred to said destination storage space, the hash key enabling the destination storage space to determine whether said data corresponding to the data item is already present at said destination storage device and does not require data transfer, said destination storage space being configured to send a signal indicating whether data transfer is required.
-
12. A system comprising two CAS data storage spaces with synchronous replication therebetween, one of said storage spaces being a source CAS data storage space and another of said storage spaces being a destination CAS data storage space, the system further comprising a communication line connecting said source storage space and said destination storage space and having a latency, communication over said communication line comprising sending data and acknowledgements in a synchronous cycle, wherein input data for storage in said system arrives at said source storage space for storage in said source storage space and replication to said destination storage space, and wherein a hash key of a data item is transferred to said destination storage space followed within said cycle by starting to transfer corresponding data without awaiting a corresponding acknowledgement, the hash key enabling the destination storage space to determine whether said corresponding data is already present at said destination storage device and does not require data transfer, and to send an acknowledgement in said respective synchronous cycle indicating whether said data transfer is required, said acknowledgement usable at said source storage space to carry out one member of the group comprising discontinuing said started transfer of said corresponding data and acknowledging to a sending application.
-
13. A CAS data storage system in which data items are stored as data content alongside hash keys identifying said data content, said data items being de-referenced on deletion, said system being configured to reuse said de-referenced data items by reading said hash keys.
- 14. A CAS data storage method wherein input data arrives at a source CAS data storage space for storage at said source storage space and replication to a destination CAS data storage space over a communication line, the method comprising filling an active buffer, at said source storage space, with incoming data for transfer to said destination storage space, said active buffer holding for each of a plurality of data items of said incoming data, a hash result of the respective data item and a storage address, the method further comprising detecting whenever there is more than one data item in said active buffer sharing a same storage address and upon such detection transferring to said replication storage space a respective hash result of only a last of said data items sharing said same storage address.
-
20. A CAS data storage method for at least one source CAS data storage space and a destination CAS data storage space, and a communication line connecting said source storage space and said destination storage space, and wherein input data for storage in said system arrives at said source storage space for storage in said destination storage space, the method comprising:
-
transferring a hash key of a data item to said destination storage space, the hash key enabling the destination storage space to determine whether data corresponding to the data item is already present at said destination storage device and thus does not require data transfer, and sending a signal indicating whether said destination storage space requires data transfer. - View Dependent Claims (21, 22)
-
-
23. A synchronized CAS data storage method for at least one source CAS data storage space and a destination CAS data storage space, and a communication line connecting said source storage space and said destination storage space and having a latency, communication over said communication line comprising sending data and acknowledgements in synchronous cycles, the method comprising:
- receiving input data at said source storage space for storage in said destination storage space;
transferring a hash key of a data item of said input data to said destination storage space in one of said synchronous cycles;within said one synchronous cycle starting to transfer corresponding data without awaiting a corresponding acknowledgement, the hash key enabling the destination storage space to determine whether it already has said corresponding data and does not require data transfer; sending an acknowledgement in said respective synchronous cycle indicating whether said data transfer is required, and one member of the group consisting of;
discontinuing said started transfer of said corresponding data if said data transfer is not required, and sending an external acknowledgement to a source application.
- receiving input data at said source storage space for storage in said destination storage space;
-
24. A CAS data storage method in which data items are stored as data content alongside hash keys identifying said data content, said data items being de-referenced on deletion, the method comprising reusing said de-referenced data items by comparing hash keys corresponding to said de-referenced data to hash keys of incoming data items.
-
25. A first CAS storage space configured to generate and use CAS metadata for storing data items internally therein, and a second CAS storage space configured to generate and use CAS metadata for storing data items internally therein, the first CAS storage space being connected via a communication line to said second CAS storage space and configured to communicate metadata of said first CAS storage space to said second CAS storage space over said communication link in a replication procedure to replicate said first CAS storage space at said second CAS storage space, said CAS metadata comprising a hash key of a data item, the hash key corresponding to a data item to be transferred and enabling the destination storage space to determine whether data corresponding to the data item is already present at said destination storage device and thereby not requiring data transfer, said destination storage space being configured to send a signal indicating whether data transfer is required for said replication.
Specification