Distributed computation recovery management system and method

US 5,325,528 A
Filed: 04/22/1993
Issued: 06/28/1994
Est. Priority Date: 04/22/1993
Status: Expired due to Term

First Claim

Patent Images

1. In a computer system having multiple application processes that interactively perform a distributed computation, the steps of the method comprising:

modeling said multiple application processes as finite state machines by storing in a computer memory in said computer system model data corresponding to each application process, said model data identifying a set of states, identifying some states of said application process as final states from which the corresponding application process is allowed to terminate and identifying other states as intermediate states from which the corresponding application process must not be allowed to terminate;

said stored model data for each application process further including state transition data identifying state transitions between identified states of said each application process as being enabled by receiving a message from another application process, by unreliably sending a message to a destination external of said each application process, and by reliably sending a message to a destination external of said each application process;

said computer system modifying said model data by selecting, in accordance with a set of predefined state transition modification criteria, ones of said state transitions enabled by unreliably sending a message, and changing said state transition data to indicate that selected state transitions are enabled by reliably sending said message;

said computer system further modifying said model data by converting ones of said intermediate states into final states, said intermediate states converted into final states being selected in accordance with a predefined set of state modification criteria; and

said computer system when executing each application process, recording on stable storage information identifying reliably sent messages and information identifying state transitions by said each application process, said identifying information being recorded in accordance with which states are identified as being intermediate states in said modified model data.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A protocol analysis system is provided with data specifying the defined states of processes participating in a distributed computation. State transitions between states are specified as being enabled by (A) receiving a message, (B) unreliably sending a message, or (C) performing an external action such as reliably sending a message. The specification data also identifies process states known to be final states, and all other states are initially denoted as intermediate states. The protocol analysis system determines if any intermediate states can be re-categorized as final states. Then it determines if any state transitions initially identified as unreliable send operations must be treated as derived external actions, and thus made reliable. Thirdly, for each derived external action, the states of the affected application process must be re-evaluated so as to determine if derived final states need to be converted into intermediate states. The resulting determinations as to which states are final states and which messages must be reliable sent are recorded and used to govern execution of the application process. When executing the application process, state transitions entering and leaving intermediate states are normally recorded on stable storage before the state transition is carried out and reliably sent messages are normally recorded on stable storage before being sent. A number of run-time journal optimization techniques reduce the number of state transitions and messages that need to be stored on stable storage.

Citations

5 Claims

1. In a computer system having multiple application processes that interactively perform a distributed computation, the steps of the method comprising:
- modeling said multiple application processes as finite state machines by storing in a computer memory in said computer system model data corresponding to each application process, said model data identifying a set of states, identifying some states of said application process as final states from which the corresponding application process is allowed to terminate and identifying other states as intermediate states from which the corresponding application process must not be allowed to terminate;
  
  said stored model data for each application process further including state transition data identifying state transitions between identified states of said each application process as being enabled by receiving a message from another application process, by unreliably sending a message to a destination external of said each application process, and by reliably sending a message to a destination external of said each application process;
  
  said computer system modifying said model data by selecting, in accordance with a set of predefined state transition modification criteria, ones of said state transitions enabled by unreliably sending a message, and changing said state transition data to indicate that selected state transitions are enabled by reliably sending said message;
  
  said computer system further modifying said model data by converting ones of said intermediate states into final states, said intermediate states converted into final states being selected in accordance with a predefined set of state modification criteria; and
  
  said computer system when executing each application process, recording on stable storage information identifying reliably sent messages and information identifying state transitions by said each application process, said identifying information being recorded in accordance with which states are identified as being intermediate states in said modified model data.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The method of claim 1, said recording step including:
    - said computer system identifying, in accordance with a predefined set of journalling optimization criteria, states of each application process that can be asynchronously journalled and identifying other states of each application process that require synchronous journalling; and
      
      said computer system synchronously journalling said states identified as requiring synchronous journalling and asynchronously journalling said states identified as being capable of journalled asynchronously;
  - 3. The method of claim 2, said recording step further including:
    - said computer system identifying a sequence of more than two states, including identifying a first state and a last state in said sequence of states, that meet predefined journalling avoidance criteria;
      
      said computer system journalling only said identified first and last states of said sequence of states.
  - 4. The method of claim 3, said recording step further including:
    - said computer system, when said identified sequence of states includes at least one state transition enabled by reliably sending a message, delaying journalling of said last state until receipt of said reliably sent message has been acknowledged by its recipient.
  - 5. The method of claim 4, said recording step further including:
    - said computer system, when said identified sequence of states includes at least one state transition enabled by receiving a reliably sent message for another application process, delaying sending an acknowledgement of receiving said reliably sent message to said other application process until said last state has been journalled.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Hewlett-Packard Development Company, L.P. (HP Inc.)
Original Assignee
Digital Equipment Corporation (HP Inc.)
Inventors
Klein, Johannes
Primary Examiner(s)
Heckler, Thomas M.

Application Number

US08/051,483
Time in Patent Office

432 Days
Field of Search

395/650, 395/700, 364/DIG. 1 MS File
US Class Current

719/313
CPC Class Codes

G06F 11/1417 Boot up procedures

H04L 9/40 Network security protocols

Distributed computation recovery management system and method

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

Citations

5 Claims

Specification

Solutions

Use Cases

Quick Links

Distributed computation recovery management system and method

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

5 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links