DATA STORAGE OPTIMIZATION USING REPLICATION STATISTICS TO AUTOMATICALLY GENERATE NVMe STREAM IDENTIFIERS
First Claim
1. A method for optimizing storage of data in a data storage replication system including a source site and a target site, comprising:
- for a plurality of write requests received from the source site;
determining transfer statistics corresponding to each of the write requests; and
updating a table with the transfer statistics;
grouping pages in the table having common transfer statistics;
assigning, at the target site, a unique non-volatile memory express (NVMe) stream identifier (ID) to each of the groups;
identifying, at the target site, grouped pages based on the assigned NVMe stream ID;
selecting a storage optimization technique for each of the groups based on the common transfer statistics; and
storing the data of the write requests for each of the groups according to the selected optimization technique;
wherein the transfer statistics include a frequency of updates for the address ranges, the frequency of updates determined, in an asynchronous replication process, by modifications made to a dirty tree of address-to-hash metadata.
3 Assignments
0 Petitions
Accused Products
Abstract
An aspect of optimizing storage of data in a data replication system includes, for a plurality of write requests received from a source site, determining transfer statistics corresponding to each of the write requests and updating a table with the transfer statistics. An aspect also includes grouping pages in the table having common transfer statistics, assigning a unique non-volatile memory express (NVMe) stream identifier (ID) to each of the groups, and identifying grouped pages based on the assigned NVMe stream ID. An aspect further includes selecting a storage optimization technique for each of the groups based on the common transfer statistics and storing data of the write requests for each of the groups according to the selected optimization technique.
0 Citations
20 Claims
-
1. A method for optimizing storage of data in a data storage replication system including a source site and a target site, comprising:
-
for a plurality of write requests received from the source site; determining transfer statistics corresponding to each of the write requests; and updating a table with the transfer statistics; grouping pages in the table having common transfer statistics; assigning, at the target site, a unique non-volatile memory express (NVMe) stream identifier (ID) to each of the groups; identifying, at the target site, grouped pages based on the assigned NVMe stream ID; selecting a storage optimization technique for each of the groups based on the common transfer statistics; and storing the data of the write requests for each of the groups according to the selected optimization technique; wherein the transfer statistics include a frequency of updates for the address ranges, the frequency of updates determined, in an asynchronous replication process, by modifications made to a dirty tree of address-to-hash metadata. - View Dependent Claims (2, 3, 6, 7)
-
-
4. (canceled)
-
5. (canceled)
-
8. A system for optimizing storage of data in a data storage replication system including a source site and a target site, comprising:
-
a memory comprising computer-executable instructions; and a processor executing the computer-executable instructions, the computer-executable instructions when executed by the processor cause the processor to perform operations comprising; for a plurality of write requests received from the source site; determining transfer statistics corresponding to each of the write requests; and updating a table with the transfer statistics; grouping pages in the table having common transfer statistics; assigning, at the target site, a unique non-volatile memory express (NVMe) stream identifier (ID) to each of the groups; identifying, at the target site, grouped pages based on the assigned NVMe stream ID; selecting a storage optimization technique for each of the groups based on the common transfer statistics; and storing the data of the write requests for each of the groups according to the selected optimization technique; wherein the transfer statistics include a frequency of updates for the address ranges, the frequency of updates determined, in an asynchronous replication process, by modifications made to a dirty tree of address-to-hash metadata. - View Dependent Claims (9, 10, 13, 14)
-
-
11. (canceled)
-
12. (canceled)
-
15. A computer program product embodied on a non-transitory computer readable medium, the computer program product including instructions that, when executed by a computer causes the computer to perform operations for optimizing storage of data in a data storage replication system including a source site and a target site, the operations comprising:
-
for a plurality of write requests received at the source site; determining transfer statistics corresponding to each of the write requests; and updating a table with the transfer statistics; grouping pages in the table having common transfer statistics; assigning, at the target site, a unique non-volatile memory express (NVMe) stream identifier (ID) to each of the groups; identifying, at the target site, grouped pages based on the assigned NVMe stream ID; selecting a storage optimization technique for each of the groups based on the common transfer statistics; and storing the data of the write requests for each of the groups according to the selected optimization technique; wherein the transfer statistics include a frequency of updates for the address ranges, the frequency of updates determined, in an asynchronous replication process, by modifications made to a dirty tree of address-to-hash metadata. - View Dependent Claims (16, 17, 20)
-
-
18. (canceled)
-
19. (canceled)
Specification