Sequential access storage and data de-duplication
First Claim
1. A method, comprising:
- producing an input sequence for acquiring, from a sequential access medium associated with a sequential access device, a set of unique data blocks associated with a data de-duplication application, where the set of unique data blocks are identified in one or more data reconstitution recipes, where the input sequence depends, at least in part, on the order in which the set of unique data blocks appear on the sequential access medium, where the input sequence is a sorted list of unique data blocks, where the sorted list is sorted in order from a unique data block that will be encountered first during a sequential read from the sequential access medium to the unique data block that will be encountered last during the sequential read from the sequential access medium,where the sequential access medium is configured with two partitions, where the sequential access medium stores one or more data de-duplication data structures in a first partition, and stores a repository of one or more unique data blocks associated with the data de-duplication data structures in a second partition, where the order of data structures in the partition storing the data de-duplication data structures is one of, index first and recipes second, and recipes first and index second, andcontrolling a data de-duplication computer to acquire the set of unique data blocks from the sequential access medium according to the input sequence, where the sequential access medium is linear tape, and where the sequential access device is a tape drive.
10 Assignments
0 Petitions
Accused Products
Abstract
An example method includes determining a configuration of two or more partitions for a sequential access medium. At least one partition stores data de-duplication data structures while at least one other partition stores a repository of unique data blocks associated with the data structures. The method also includes controlling a data de-duplication computer to configure the sequential access medium according to the configuration. The method includes producing an output sequence for writing the data structures and a set of unique data blocks associated with the set of data structures to the sequential access medium as configured with the two or more partitions. One embodiment includes controlling a data de-duplication computer to write the data de-duplication data structures and the set of unique data blocks to the sequential access medium according to the output sequence.
-
Citations
14 Claims
-
1. A method, comprising:
-
producing an input sequence for acquiring, from a sequential access medium associated with a sequential access device, a set of unique data blocks associated with a data de-duplication application, where the set of unique data blocks are identified in one or more data reconstitution recipes, where the input sequence depends, at least in part, on the order in which the set of unique data blocks appear on the sequential access medium, where the input sequence is a sorted list of unique data blocks, where the sorted list is sorted in order from a unique data block that will be encountered first during a sequential read from the sequential access medium to the unique data block that will be encountered last during the sequential read from the sequential access medium, where the sequential access medium is configured with two partitions, where the sequential access medium stores one or more data de-duplication data structures in a first partition, and stores a repository of one or more unique data blocks associated with the data de-duplication data structures in a second partition, where the order of data structures in the partition storing the data de-duplication data structures is one of, index first and recipes second, and recipes first and index second, and controlling a data de-duplication computer to acquire the set of unique data blocks from the sequential access medium according to the input sequence, where the sequential access medium is linear tape, and where the sequential access device is a tape drive. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method, comprising:
-
determining a configuration for a sequential access medium to store one or more data de-duplication data structures and a repository of unique data blocks associated with the one or more data de-duplication data structures, where the sequential access medium is linear tape, where the sequential access medium is configurable with more than two partitions, and where the configuration depends, at least in part, on a number of partitions available to create on the sequential access medium; controlling a data de-duplication computer to configure the sequential access medium according to the configuration; producing an input sequence for acquiring, from the sequential access medium, a retrieval set of unique data blocks associated with a data de-duplication application; and controlling the data de-duplication computer to acquire the retrieval set of unique blocks from the sequential access medium according to the input sequence. - View Dependent Claims (8, 9)
-
-
10. An apparatus, comprising:
-
a processor; a memory; and an interface connecting the processor, the memory, and a set of logics, the set of logics comprising; a configuration logic that prepares a sequential access medium to store a set of data de-duplication data structures and a repository of data blocks associated with the set of data de-duplication data structures, where preparing the sequential access medium comprises partitioning the sequential access medium into three or more partitions, where the sequential access medium is linear tape; and a write logic that writes the set of data de-duplication data structures and the repository of data blocks to the sequential access medium, where one member of the set of data de-duplication data structures is written to one partition, where a second member of the set of data de-duplication data structures is written to a second, different partition, and where the repository of data blocks is written to a third, different partition. - View Dependent Claims (11, 12, 13, 14)
-
Specification