Automated file error classification and correction in a hierarchical storage management system
First Claim
1. A method to diagnose and handle an error in an automated hierarchical storage management system, comprising:
- a) processing a host request for an access operation on a first file in a first storage device with a server to determine whether the first file is resident in the first storage device;
b) initiating a recall with the server of a second file in a second storage device corresponding to the first file upon determining that the first file is not resident in the first storage device;
c) copying the second file from the second storage device to the first file in the first storage device upon determining that the second file is accessible;
d) determining whether the recall of the second file in the second storage device has failed;
e) initiating a process to check a table in a memory within the server to determine information indicating operations to perform if the recall of the second file failed; and
f) notifying the server to retry the access operation after performing the operations.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed is a system to diagnose and handle errors in an automated hierarchical storage management system. A host system requests an access operation on a first file, such as a logical volume, in a first storage device, such as a magnetic hard disk drive. A server processes the host request to determine whether the first file is resident in the first storage device. The server initiates a recall of a second file, such as a physical volume, in a second storage device, such as tape cartridges, optical disks, etc., corresponding to the first file upon determining that the first file is not resident in the first storage device. The second file is then copied from the second storage device to the first file in the first storage device upon determining that the second file is accessible. The server further determines whether the recall of the second file has failed. Upon determining that the recall has failed, the server checks a table in a memory within the server to determine whether there is error information listed for the second file involved in the failed recall. The server then takes appropriate action based on the error information in the table.
187 Citations
29 Claims
-
1. A method to diagnose and handle an error in an automated hierarchical storage management system, comprising:
-
a) processing a host request for an access operation on a first file in a first storage device with a server to determine whether the first file is resident in the first storage device; b) initiating a recall with the server of a second file in a second storage device corresponding to the first file upon determining that the first file is not resident in the first storage device; c) copying the second file from the second storage device to the first file in the first storage device upon determining that the second file is accessible; d) determining whether the recall of the second file in the second storage device has failed; e) initiating a process to check a table in a memory within the server to determine information indicating operations to perform if the recall of the second file failed; and f) notifying the server to retry the access operation after performing the operations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 10)
-
-
9. A method to diagnose and handle an error in an automated hierarchical storage management system, comprising:
-
(a) requesting with a host system an access operation on a first file in a first storage device; (b) processing the host request with a server to determine whether the first file is resident in the first storage device; (c) initiating a recall with the server of a second file in a second storage device corresponding to the first file upon determining that the first file is not resident in the first storage device; (d) copying the second file from the second storage device to the first file in the first storage device upon determining that the second file is accessible; (e) determining whether the recall of the second file in the second storage device has failed; (f) checking with the server a table in a memory within the server to determine whether there is error information that indicates that there is a permanent read error in the second file if the recall of the second file has failed; (g) moving the first file to a third file in the first storage device; (h) recovering as much data as possible from the second file; (i) copying the data recovered from the second file to the first file in the first storage device; and (j) retrying the access operation on the first file.
-
-
11. A computer system to diagnose and handle an error in an automated hierarchical storage management system when a host system attempts an access operation, comprising:
-
(a) a server in communication with the host system, wherein the server includes means for processing a request by the host system to access a first file; (b) a first storage device in communication with the server; (c) a second storage device in communication with the server; (d) means, performed by the server, for determining whether the first file is resident in the first storage device; (e) means, performed by the server, for initiating a recall of a second file in a second storage device corresponding to the first file upon determining that the first file is not resident in the first storage device; (f) means for copying the second file from the second storage device to the first file in the first storage device upon determining that the second file is accessible; (g) means, performed by the server, for determining whether the recall of the second file in the second storage device has failed; (h) a memory in the server storing a table containing error information for files in the second storage device; (i) means, performed by the server, for initiating a process to check the table to determine information indicating operations to perform if the recall of the second file has failed; and (j) means for notifying the server to retry the access operation on the first file after performing the operations. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer system to diagnose and handle an error in an automated hierarchical storage management system when a host system attempts an access operation, comprising:
-
a server in communication with the host system, wherein the server includes means for processing a request by the host system to access a first file; a first storage device in communication with the server; a second storage device in communication with the server; means, performed by the server, for determining whether the first file is resident in the first storage device; means, performed by the server, for initiating a recall of a second file in a second storage device corresponding to the first file upon determining that the first file is not resident in the first storage device; means for copying the second file from the second storage device to the first file in the first storage device upon determining that the second file is accessible; means, performed by the server, for determining whether the recall of the second file in the second storage device has failed; a memory in the server storing a table containing error information for files in the second storage device; means, performed by the server, for checking the table to determine whether the table includes error information that indicates that there is a permanent read error in the second file if the recall of the second file has failed; and means for moving the first file to a third file in the first storage device; means for recovering as much data as possible from the second file; means for copying the data recovered from the second file to the first file in the first storage device, thereby replacing the contents of the first file with the data recovered from the second file; and means, performed by the server, for retrying the access operation on the first file. - View Dependent Claims (20, 24, 25, 26)
-
-
21. An article of manufacture for use in programming a server to diagnose and handle errors in an automated hierarchical storage management system, the article of manufacture comprising a computer readable storage medium having computer objects embodied therein that cause the server to perform the steps of:
-
(a) processing a request from a host system to access a first file in a first storage device; (b) determining whether the first file is resident in the first storage device; (c) initiating a recall of a second file in a second storage device corresponding to the first file upon determining that the first file is not resident in the first storage device; (d) requesting to copy the second file from the second storage device to the first file in the first storage device upon determining that the second file is accessible; (e) determining whether the recall of the second file in the second storage device has failed; (f) initiating a process to check a table in a memory within the server to determine information indicating operations to perform if the recall of the second file has failed; and (g) notifying the server to retry the access operation on the first file after performing the operations. - View Dependent Claims (22, 23, 27)
-
-
28. An article of manufacture for use in programming a server to diagnose and handle errors in an automated hierarchical storage management system, the article of manufacture comprising a computer readable storage medium having computer objects embodied therein that cause the server to perform the steps of:
-
processing a request from a host system to access a first file in a first storage device; determining whether the first file is resident in the first storage device; initiating a recall of a second file in a second storage device corresponding to the first file upon determining that the first file is not resident in the first storage device; requesting to copy the second file from the second storage device to the first file in the first storage device upon determining that the second file is accessible; determining whether the recall of the second file in the second storage device has failed; checking a table in a memory within the server to determine whether there is error information that indicates that there is a permanent read error in the second file if the recall of the second file has failed; moving the first file to a third file in the first storage device; initiating an operation to recover as much data as possible from the second file; copying the data recovered from the second file to the first file in the first storage device; and retrying the access operation on the first file. - View Dependent Claims (29)
-
Specification