Detecting and managing orphan files between primary and secondary data stores for content addressed storage
First Claim
1. In a distributed computing system having a primary storage device, a secondary storage device and a policy engine server, the policy engine server being configured to manage migration of files between the primary and secondary storage devices by migrating data files on the primary storage device to the secondary storage device and replacing the data files on the primary storage device with placeholder files, a method for locating orphan files within the secondary storage device that are not properly referenced by placeholder files on the primary storage device, the method comprising:
- identifying a secondary file on a secondary storage device, wherein the secondary file corresponds to a data file on a primary storage device that has been replaced with a placeholder file, the secondary file being associated with file identification data, wherein the file identification data includes a content address that references the placeholder file on the primary storage device;
identifying the placeholder file on the primary storage device, the placeholder file including an offline reference that identifies a particular secondary file on the secondary storage device; and
determining if the offline reference of the placeholder file validly references the content address of the secondary file, wherein the secondary file is identified as an orphan file on the secondary storage device if the secondary file is different from the particular secondary file identified by the offline reference of the placeholder file.
9 Assignments
0 Petitions
Accused Products
Abstract
A method and system for locating and eliminating orphan files within a secondary storage device. The method includes identifying a secondary file on a secondary storage device, the secondary file being associated with file identification data, wherein the file identification data includes a content address, identifying a placeholder file on a primary storage device, the placeholder file being associated with an offline reference, and determining if the offline reference of the placeholder file validly references the content address of the secondary file.
106 Citations
20 Claims
-
1. In a distributed computing system having a primary storage device, a secondary storage device and a policy engine server, the policy engine server being configured to manage migration of files between the primary and secondary storage devices by migrating data files on the primary storage device to the secondary storage device and replacing the data files on the primary storage device with placeholder files, a method for locating orphan files within the secondary storage device that are not properly referenced by placeholder files on the primary storage device, the method comprising:
-
identifying a secondary file on a secondary storage device, wherein the secondary file corresponds to a data file on a primary storage device that has been replaced with a placeholder file, the secondary file being associated with file identification data, wherein the file identification data includes a content address that references the placeholder file on the primary storage device; identifying the placeholder file on the primary storage device, the placeholder file including an offline reference that identifies a particular secondary file on the secondary storage device; and determining if the offline reference of the placeholder file validly references the content address of the secondary file, wherein the secondary file is identified as an orphan file on the secondary storage device if the secondary file is different from the particular secondary file identified by the offline reference of the placeholder file. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A content addressable data migration storage system for identifying orphan files, comprising:
-
a primary storage device employing one of location addressable storage or content addressable storage and configured to store and detect placeholder files, wherein the placeholder files replace data files that have been migrated to a secondary storage device, each of the placeholder files including an offline reference for identifying a secondary file; the secondary storage device employing content addressable storage and configured to store secondary files that are the data files that have been migrated from the primary storage device, each of the secondary files including a content address or a location address and at least one online reference to a placeholder file on the primary storage device; and a policy engine server including a file management module for identifying secondary files that are not identified by any offline references of placeholder files on the primary storage as orphan files wherein orphan files identify secondary files in the secondary storage device that has no corresponding placeholder file in the primary storage device. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. In a distributed computer having a primary storage device, a secondary storage device and a policy engine server, the policy engine server being configured to manage migration of files between the primary storage device and the secondary storage device by migrating data files on the primary storage device to the secondary storage device and replacing the data files on the primary storage device with placeholder files, a computer program product configured to implement a method of locating and eliminating orphan files within the secondary storage device that are not properly referenced by placeholder files on the primary storage device, the computer program product comprising one or more computer readable media having stored thereon computer executable instructions that, when executed by a processor, cause the policy engine server to perform the following:
-
identify a secondary file on a secondary storage device, wherein the secondary file corresponds to a data file on a primary storage device that has been replaced with a placeholder file, the secondary file being associated with file identification data, wherein the file identification data includes a content address that references the placeholder file identify the placeholder file on the primary storage device, the placeholder file including an offline reference that identifies a particular secondary file on the secondary storage device; and determine if the offline reference of the placeholder file validly references the content address of the secondary file, wherein the secondary file is identified as an orphan file on the secondary storage device if the secondary file is not the particular secondary file identified by the offline reference of the placeholder file. - View Dependent Claims (17, 18, 19, 20)
-
Specification