System and method for error handling and recovery
First Claim
1. An error recovery method comprising:
- logging one or more file operation errors in an error queue in a content distribution network, said file operation errors including a file operation portion and an error code portion;
periodically reading said file operation errors from said error queue;
determining whether automatic error recovery is possible based on an error recovery policy;
performing an automated error recovery procedure if error recovery is possible;
wherein said error recovery policy includes information as to when specified portions of said network were inoperative, information as to when particular file servers were inoperative; and
information as to whether a file associated with said file operation error was not available on a specified source server.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for error handling and recovery in a content distribution system is described in which errors corresponding to failed file operations (e.g., file transfer errors, file delete errors) are placed in an error queue. Error analysis logic reads the errors from the error queue and makes a determination as to whether the file operation errors are recoverable errors based on an error recovery policy. If the error analysis logic determines that recovery is possible, then one or more error recovery procedures are invoked. The procedures may be specific to the content delivery system (e.g., “Server X was down on 1/20 between 10:20 and 11:00 AM”), or may be more general (e.g., “attempt file transfers 5 times before quitting”). If it is determined that an error is not automatically recoverable, then the error is included in an error report.
-
Citations
12 Claims
-
1. An error recovery method comprising:
-
logging one or more file operation errors in an error queue in a content distribution network, said file operation errors including a file operation portion and an error code portion;
periodically reading said file operation errors from said error queue;
determining whether automatic error recovery is possible based on an error recovery policy;
performing an automated error recovery procedure if error recovery is possible;
wherein said error recovery policy includes information as to when specified portions of said network were inoperative, information as to when particular file servers were inoperative; and
information as to whether a file associated with said file operation error was not available on a specified source server.- View Dependent Claims (2, 3, 4, 5, 6)
generating a report if error recovery is not possible.
-
-
3. The method as in claim 1 wherein said file operation errors comprise file transfer errors.
-
4. The method as in claim 1 wherein said file operation errors are file delete errors.
-
5. The method as in claim 1 wherein one of said error recovery procedures comprises:
reattempting file operations corresponding to said file operation errors if said file operations were previously attempted a number of times less than a predetermined threshold value.
-
6. The method as in claim 1 wherein one of said error recovery procedures comprises:
-
determining whether a group of said file operation errors have identical error causes over a finite period of time; and
reattempting file operations corresponding to said group of file operation errors.
-
-
7. An article of manufacture including a sequence of instructions which, when executed on a processor, cause the processor to:
-
log one or more file operation errors in an error queue in a content distribution network, said file operation errors including a file operation portion and an error code portion;
read said file operation errors from said error queue;
determine whether automatic error recovery is possible based on an error recovery policy; and
perform an automated error recovery procedure if error recovery is possible, wherein said error recovery policy includes information as to whether a file associated with said file operation error was not available on a specified source server. - View Dependent Claims (8, 9, 10, 11, 12)
generate a report if error recovery is not possible.
-
-
9. The article of manufacture as in claim 7 wherein said error recovery policy includes information as to when specified portions of said network were inoperative.
-
10. The article of manufacture as in claim 7 wherein said error recovery policy includes information as to when particular file servers were inoperative.
-
11. The article of manufacture as in claim 7 including further instructions defining an error recovery procedure which cause said processor to:
reattempting file operations corresponding to said file operation errors if said file operations were previously attempted a number of times less than a predetermined threshold value.
-
12. The article of manufacture as in claim 7 including further instructions defining an error recovery procedure which cause said processor to:
-
determining whether a group of said file operation errors have identical error causes over a finite period of time; and
reattempting file operations corresponding to said group of file operation errors.
-
Specification