Controlling tape layout for de-duplication
First Claim
1. A non-transitory computer-readable medium storing computer-executable instructions that when executed by a computer control the computer to perform a method, the method comprising:
- accessing a de-duplication data set, where the de-duplication data set includes one or more of, a set of sub-blocks produced by a de-duplication process, an index produced by the de-duplication process, the index storing index entries related to members of the set of sub-blocks, and re-creation information produced by the de-duplication process, the re-creation information storing data for re-creating a larger data block from two or more sub-blocks in the set of sub-blocks;
identifying a de-duplication attribute associated with the de-duplication data set;
determining a tape-specific layout for at least a portion of the de-duplication data set based, at least in part, on the de-duplication attribute; and
controlling a tape drive to write at least a portion of the de-duplication data set to a first tape according to the tape-specific layout.
10 Assignments
0 Petitions
Accused Products
Abstract
Example methods and apparatus concern creating and/or manipulating a tape-specific layout for a tape based de-duplication repository. One example apparatus include a processor, a memory, a set of logics and an interface to connect the processor, the memory, and the set of logics. The apparatus may include a tape layout logic configured to determine the tape-specific layout for a de-duplication data set. The tape-specific layout may be based on forensic data acquired for the de-duplication data set. The apparatus may also include a write logic configured to write the de-duplication data set to a tape according to the tape-specific layout. The forensic data may identify, for example, the order in which sub-blocks are accessed, reference counts, access frequency, access groups, and other access information.
-
Citations
30 Claims
-
1. A non-transitory computer-readable medium storing computer-executable instructions that when executed by a computer control the computer to perform a method, the method comprising:
-
accessing a de-duplication data set, where the de-duplication data set includes one or more of, a set of sub-blocks produced by a de-duplication process, an index produced by the de-duplication process, the index storing index entries related to members of the set of sub-blocks, and re-creation information produced by the de-duplication process, the re-creation information storing data for re-creating a larger data block from two or more sub-blocks in the set of sub-blocks; identifying a de-duplication attribute associated with the de-duplication data set; determining a tape-specific layout for at least a portion of the de-duplication data set based, at least in part, on the de-duplication attribute; and controlling a tape drive to write at least a portion of the de-duplication data set to a first tape according to the tape-specific layout. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. An apparatus, comprising:
-
a processor; a memory; a set of logics; and an interface to connect the processor, the memory, and the set of logics, the set of logics comprising; a tape layout logic configured to determine a tape-specific layout for a de-duplication data set based on forensic data acquired for the de-duplication data set; and a write logic configured to write the de-duplication data set to a tape according to the tape-specific layout. - View Dependent Claims (24, 25, 26, 27, 28, 29)
-
-
30. A system, comprising:
-
means for identifying an access pattern associated with a de-duplication data set; and means for controlling a layout for a tape on which the de-duplication is to be stored, where the layout depends, at least in part, on the access pattern.
-
Specification