Job interrupt at predetermined boundary for enhanced recovery

US 4,847,749 A
Filed: 06/13/1986
Issued: 07/11/1989
Est. Priority Date: 06/13/1986
Status: Expired due to Fees

First Claim

Patent Images

1. A method of restarting a computer system in the event of a failure, the computer system running jobs and having directories relating to data, a main storage area, and at least one direct access storage device, the method comprising the steps of:

detecting the failure;

saving an image of main storage into a nonvolatile storage area in response to detection of the failure;

correcting the failure;

reloading the main storage image into said main storage after correction of the failure;

marking jobs for interruption at a predetermined system boundary; and

running jobs for a predetermined time to permit jobs to attain the predetermined system boundary such that directories are in a known state.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A recovery mechanism restarts jobs following correction of a system failure and automatically marks the jobs for interruption at a logical boundary. The logical boundary is above logical file updating functions such that logical files are in a known state when jobs reach the boundary. When a system failure is detected which has not yet resulted in lost data, an image of working memory, including hardware status is saved on nonvolatile storage. After the failure has been resolved, the system is initially loaded with operating programs (IPL) and working memory is reloaded from the nonvolatile storage. All jobs which were reloaded are marked for interrupt at a machine instruction boundary, and processing is started. After all jobs have reached the boundary, or a predetermined time has elapsed, processing is stopped and the system is re-IPLed. There are few system index recoveries to be performed, since most jobs reached a point where logical files were synchronized with corresponding data.

111 Citations

15 Claims

1. A method of restarting a computer system in the event of a failure, the computer system running jobs and having directories relating to data, a main storage area, and at least one direct access storage device, the method comprising the steps of:
- detecting the failure;
  
  saving an image of main storage into a nonvolatile storage area in response to detection of the failure;
  
  correcting the failure;
  
  reloading the main storage image into said main storage after correction of the failure;
  
  marking jobs for interruption at a predetermined system boundary; and
  
  running jobs for a predetermined time to permit jobs to attain the predetermined system boundary such that directories are in a known state.

2. A method of restarting a computer system in the event of an undesirable condition, the computer system having logical files relating to data stored on a plurality of storage devices, and tasks and jobs running on the system from a main storage, the jobs having the capability to change logical files when running below a predetermined logical boundary, the method comprising the steps of:
- detecting the undesirable condition which has not yet caused a data loss;
  
  saving an image of main storage into a nonvolatile storage area;
  
  correcting the undesirable condition;
  
  reloading the main storage image into said main storage;
  
  marking jobs for interruption at the predetermined logical system boundary; and
  
  running jobs for a predetermined time to permit most jobs to attain the predetermined system boundary such that logical files are in a known state.
- View Dependent Claims (3, 4, 5, 6, 7)
- - 3. The method of claim 2 wherein a machine check task is in control of the system prior to the step of saving an image of main storage and wherein the machine check task prevents other tasks from gaining control of the system.
  - 4. The method of claim 3 wherein, the saved image of said main storage contains an indication of the job to begin operation with when said main storage is reloaded, and an indication of what point in the task to begin operating.
  - 5. The method of claim 2 wherein during the predetermined time, jobs having reached the boundary are so marked.
  - 6. The method of claim 5 wherein the jobs are logically chained together and periodically searched to determine if they are marked as having reached the boundary.
  - 7. The method of claim 6 wherein upon the search finding all jobs having been marked as reaching the boundary, the system is reset with initialization programming.

8. A computer system having data directories relating to data stored on said system, the system having a main working storage area which has a job queue from which jobs are selected for operation upon by the system, and at least one selected for operation upon by the system, and at least one nonvolatile storage device, the system being restartable following an undesirable system condition, the system comprising:
- means for interrupting the system from operating on the jobs;
  
  means responsive to the means for interrupting the system for saving an image of said main working storage including a representation of the status of the system with respect to the job the system is presently operating upon;
  
  means coupled to said main working storage for reloading the image of said main working storage following correction of the undesirable system condition;
  
  means coupled to said main working storage for marking jobs for interruption at a predetermined system boundary, above which data directories are not normally changed;
  
  means coupled to said main working storage for restarting system operation on jobs where the jobs were interrupted using the reloaded main working storage image; and
  
  means coupled to said main working storage for monitoring jobs running on the system to determine when the jobs have reached the predetermined system boundary such that directories are in a known state.

9. A computer system having data directories relating to data stored on said system, the system having a main working storage area which has a task queue from which tasks and jobs are selected for operation upon by the system, wherein jobs are tasks capable of changing directories, the system comprising:
- means coupled to said task queue for marking jobs in the queue for interruption at a predetermined system boundary, above which data directories are not normally changed; and
  
  means coupled to said task queue for monitoring jobs running on the system to determine when the jobs have reached the predetermined system boundary such that directories are in a known state.
- View Dependent Claims (10, 11, 12, 13, 14, 15)
- - 10. The computer system of claim 9 wherein tasks are linked by address information, forming a list, and each job on the task queue is represented by a task dispatching element comprising task identification information and linkage information.
  - 11. The computer system of claim 10 wherein each task dispatching element contains a boundary interruption flag which is set by the means for marking jobs if the task is a job to be interrupted at the boundary.
  - 12. The computer system of claim 11 and further comprising means coupled to the task dispatching queue for removing jobs from the task dispatching queue upon said jobs reaching the boundary.
  - 13. The computer system of claim 12 wherein the means for removing jobs sets a job reached boundary flag in the task dispatching element of each job reaching the boundary if the boundary interruption flag is set.
  - 14. The computer system of claim 13 wherein the means for monitoring jobs periodically searches through the linked list of tasks to determine fi the job reached boundary flag in the jobs in the linked list is set.
  - 15. The computer system of claim 14 wherein processing of tasks on the task dispatching queue continues when the means for monitoring jobs encounters a job in the linked list which does not have its job reached boundary flag set.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
International Business Machines Corporation
Original Assignee
International Business Machines Corporation
Inventors
Collins, Robert W., Dickes, Steven M., Larson, Carle J., Weinschenk, Russell J., Wottreng, Peter M., Effle, James S., Davidson, William S.
Primary Examiner(s)
Zache, Raulfe B.
Assistant Examiner(s)
Harrell, Robert B.

Application Number

US06/873,909
Time in Patent Office

1,124 Days
Field of Search

364/900 MS File, 364/200 MS File
US Class Current

714/6.12
CPC Class Codes

G06F 11/1435 using file system or storag...

Job interrupt at predetermined boundary for enhanced recovery

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

111 Citations

15 Claims

Specification

Use Cases

Quick Links

Others

Job interrupt at predetermined boundary for enhanced recovery

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

111 Citations

15 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others