×

Self-evolving distributed system performance using a system health index

  • US 7,698,239 B2
  • Filed: 04/28/2006
  • Issued: 04/13/2010
  • Est. Priority Date: 03/25/2005
  • Status: Active Grant
First Claim
Patent Images

1. A method of improving the performance of a distributed computing system comprising:

  • determining an ideal system health index that is a quantifiable representation of a user-defined performance characteristic of the distributed system, the distributed system comprising a peer-to-peer network including a plurality of nodes, the ideal system health index including a plurality of system performance metrics collected from the plurality of nodes, each metric modified by one or more system variables;

    propagating one or more system performance metrics through an artificial intelligence system, the system performance metrics measured during operation of the distributed system, the propagating including;

    multiplying each system performance metric by one or more first weights,processing each weighted system performance metric by one or more functions within one or more hidden layers of the artificial intelligence system, each function accounting for the dependence of each system performance metric on one or more other system performance metrics, andconverting each weighted and processed system performance metric to a plurality of improved system variables by multiplying each weighted and processed system performance metric by one or more second weights;

    modifying one or more system variables with the plurality of improved system variables;

    determining a system health index by aggregating the system performance metrics after modifying one or more system variables with the plurality of improved system variables;

    comparing the system health index to the ideal system health index to derive a health index error value; and

    when the health index error value is above a threshold;

    identifying a slow or failing node of the plurality of nodes in the peer-to-peer network based on the system health index,correcting the plurality of improved distributed system variables of the slow or failing node by applying the health index error value to the plurality of improved distributed system variables of the slow or failing node;

    if the health index error value is below the threshold, saving the improved distributed system variables,modifying the artificial intelligence system by applying a learning algorithm to the artificial intelligence system to reduce the health index error value, the learning algorithm using the health index error value,re-propagating the system performance metrics through the modified artificial intelligence system to derive a second plurality of improved system variables, andapplying the second plurality of improved system variables to at least the slow or failing node.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×