Multi stream deduplicated backup of collaboration server data
First Claim
Patent Images
1. A method of backing up data, comprising:
- receiving an indication associated with a backup of a collaboration server dataset;
dividing the dataset into a prescribed number of subsets; and
using a plurality of separate threads to back up the subsets in parallel, wherein the plurality of threads are configured to provide data included in the corresponding subset to a corresponding de-duplicating backup process instance that is configured to perform de-duplication processing with respect to the subset and a corresponding subset associated with a prior backup, wherein the corresponding subset was determined by walking an associated directory in the prescribed order at a prior time with which the prior backup is associated.
9 Assignments
0 Petitions
Accused Products
Abstract
Techniques to backup collaboration server data are disclosed. An indication to begin backup of a collaboration server dataset is received. An associated directory is walked in a prescribed order to divide the dataset into a prescribe number of approximately equal-sized subsets. A separate subset-specific thread is used to back up the subsets in parallel. In some embodiments in which the collaboration data is stored in multiple volumes, a volume-based approach is used to back up the volumes in parallel, e.g., one volume per thread. In some embodiments, transaction logs are backed up in parallel with volumes of collaboration data.
-
Citations
18 Claims
-
1. A method of backing up data, comprising:
receiving an indication associated with a backup of a collaboration server dataset;
dividing the dataset into a prescribed number of subsets; and
using a plurality of separate threads to back up the subsets in parallel, wherein the plurality of threads are configured to provide data included in the corresponding subset to a corresponding de-duplicating backup process instance that is configured to perform de-duplication processing with respect to the subset and a corresponding subset associated with a prior backup, wherein the corresponding subset was determined by walking an associated directory in the prescribed order at a prior time with which the prior backup is associated.- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
11. A system to back up collaboration server data, comprising:
- a memory of other storage device configured to store a directory associated with a collaboration server dataset; and
a processor configured to;receive an indication associated with a backup of a collaboration server dataset;
divide the dataset into a prescribed number of subsets; and
user a plurality of separate threads to back up the subsets in parallel, wherein the plurality of threads are configured to provide data included in the corresponding subset to a corresponding de-duplicating backup process instance that is configured to perform de-duplication processing with respect to the subset and a corresponding subset associated with a prior backup, wherein the corresponding subset was determined by walking an associated directory in the prescribed order at a prior time with which the prior backup is associated. - View Dependent Claims (12, 13, 14, 15, 16, 17)
- a memory of other storage device configured to store a directory associated with a collaboration server dataset; and
-
18. A computer program product to back up data, the computer program product being embodied in a non-transitory computer readable storage medium and comprising computer instructions for:
receiving an indication associated with a backup of a collaboration server dataset;
dividing the dataset into a prescribed number of subsets; and
using a plurality of separate threads to back up the subsets in parallel, wherein the plurality of threads are configured to provide data included in the corresponding subset to a corresponding de-duplicating backup process instance that is configured to perform de-duplication processing with respect to the subset and a corresponding subset associated with a prior backup, wherein the corresponding subset was determined by walking an associated directory in the prescribed order at a prior time with which the prior backup is associated.
Specification