Iterating in parallel for deduplication
First Claim
Patent Images
1. A method for use in iterating in parallel for deduplication, the method comprising:
- selecting a collection of sections from a set of storage extents based on an iteration scheme, wherein a deduplication domain includes the set of storage extents, wherein each storage extent of the set of storage extents is apportioned into a set of sections, wherein each section of the collection of sections includes subset of the contents of a storage extent of the set of storage extents, wherein the contents of a storage extent includes a set of data blocks;
arranging each section of the collection of sections in an ordered arrangement based on the iteration scheme selected from a list of iteration schemes for applying a deduplicating technique, wherein the list of iteration schemes include a parallel iteration scheme, wherein each iteration scheme of the list of iteration schemes indicates a manner in which the set of storage extents is iterated for applying the deduplicating technique, wherein the ordered arrangement indicates an order in which each section of the collection of sections is processed for applying the deduplicating technique; and
based on the ordered arrangement, applying the deduplicating technique in parallel to each section of the collection of sections, wherein data blocks in each section of the collection of sections are deduplicated in parallel based on the ordered arrangement, wherein a number of sections of the collection of sections selected for applying the deduplicating technique in parallel are based on a set of processes used for deduplication, wherein each section of the number of sections is iterated by a process of the set of processes.
9 Assignments
0 Petitions
Accused Products
Abstract
A method is used in iterating in parallel for deduplication. Based on an iteration scheme, a collection of sections is selected from a set of storage extents. Each section of the collection of sections includes subset of the contents of a storage extent of the set of storage extents. Based on the iteration scheme, each section of the collection of sections is arranged in an ordered arrangement. Based on the ordered arrangement, a deduplicating technique is applied in parallel to each section of the collection of sections.
-
Citations
20 Claims
-
1. A method for use in iterating in parallel for deduplication, the method comprising:
-
selecting a collection of sections from a set of storage extents based on an iteration scheme, wherein a deduplication domain includes the set of storage extents, wherein each storage extent of the set of storage extents is apportioned into a set of sections, wherein each section of the collection of sections includes subset of the contents of a storage extent of the set of storage extents, wherein the contents of a storage extent includes a set of data blocks; arranging each section of the collection of sections in an ordered arrangement based on the iteration scheme selected from a list of iteration schemes for applying a deduplicating technique, wherein the list of iteration schemes include a parallel iteration scheme, wherein each iteration scheme of the list of iteration schemes indicates a manner in which the set of storage extents is iterated for applying the deduplicating technique, wherein the ordered arrangement indicates an order in which each section of the collection of sections is processed for applying the deduplicating technique; and based on the ordered arrangement, applying the deduplicating technique in parallel to each section of the collection of sections, wherein data blocks in each section of the collection of sections are deduplicated in parallel based on the ordered arrangement, wherein a number of sections of the collection of sections selected for applying the deduplicating technique in parallel are based on a set of processes used for deduplication, wherein each section of the number of sections is iterated by a process of the set of processes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for use in iterating in parallel for deduplication, the system comprising a processor configured to:
-
select a collection of sections from a set of storage extents based on an iteration scheme, wherein a deduplication domain includes the set of storage extents, wherein each storage extent of the set of storage extents is apportioned into a set of sections, wherein each section of the collection of sections includes subset of the contents of a storage extent of the set of storage extents, wherein the contents of a storage extent includes a set of data blocks; arrange each section of the collection of sections in an ordered arrangement based on the iteration scheme selected from a list of iteration schemes for applying a deduplicating technique, wherein the list of iteration schemes include a parallel iteration scheme, wherein each iteration scheme of the list of iteration schemes indicates a manner in which the set of storage extents is iterated for applying the deduplicating technique, wherein the ordered arrangement indicates an order in which each section of the collection of sections is processed for applying the deduplicating technique; and apply, based on the ordered arrangement, the deduplicating technique in parallel to each section of the collection of sections, wherein data blocks in each section of the collection of sections are deduplicated in parallel based on the ordered arrangement, wherein a number of sections of the collection of sections selected for applying the deduplicating technique in parallel are based on a set of processes used for deduplication, wherein each section of the number of sections is iterated by a process of the set of processes. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification