×

Automatically identifying an optimal set of attributes to facilitate generating best practices for configuring a networked system

  • US 7,685,277 B2
  • Filed: 01/21/2008
  • Issued: 03/23/2010
  • Est. Priority Date: 01/21/2008
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method of automatically identifying an optimal set of attributes of entities to facilitate generating best practices for configuring a networked system, comprising:

  • ranking, by a computing system and based on a plurality of information gain values, a plurality of entity types of a plurality of entities included in said networked system, wherein said ranking said plurality of entity types comprises;

    determining an information entropy value H(T) as

    j =

    1
    n


    p

    ( j )


    log 2

    ( 1 p

    ( j )
    )
    ,
    wherein T is an entity type of said plurality of entity types, wherein p(j) is a probability of an occurrence of an outcome j of T, and wherein n is a total number of possible outcomes of T,identifying a set of entities that report a problem, wherein said set of entities is included in said plurality of entities,determining, subsequent to said identifying said set of entities, P true data paths that are internal to said set of entities, wherein Pi true data paths of said P true data paths are included in an i-th partition of k partitions of a configuration of said networked system, and wherein k>

    1,determining, subsequent to said identifying said set of entities, N false data paths that are external to said set of entities, wherein Ni false data paths of said N false data paths are included in said i-th partition of said configuration,determining an entropy value Entropy(Pi,Ni) as - ( P i P i +

    N i
    ×



    log 2

    P i P i +

    N i
    + N i P i +

    N i
    ×

    log 2

    N i P i +

    N i
    )
    ,
    determining an information gain value Gain(T) of said plurality of information gain values as H

    ( T )
    -

    i =

    1
    k


    P i +

    N i
    P

    +

    N
    ×



    Entropy

    ( P i , N i )
    ,
    wherein Gain(T) indicates an information gain of said entity type T, andordering said entity type T relative to one or more other entity types of said plurality of entity types based on Gain(T);

    determining, by said computing system and subsequent to said ranking, a first classification accuracy relative to a first entity type, wherein said first entity type is a highest ranked entity type of said plurality of entity types based on said ranking or is a first aggregate entity type associated with two or more entity types of said plurality of entity types;

    selecting, by said computing system and subsequent to said determining said first measurement, a second entity type of said plurality of entity types, wherein said selecting is based on said ranking;

    performing, by said computing system, a database join operation on a first set of one or more attributes of one or more entities of said first entity type and a second set of one or more attributes of one or more entities of said second entity type, wherein a result of said performing is a second aggregate entity type;

    determining, by said computing system, second classification accuracy relative to said second aggregate entity type;

    determining, by said computing system, that said second classification accuracy is less than or equal to said first classification accuracy;

    identifying, by said computing system and in response to said determining that said second measurement is less than or equal to said first measurement, an optimal set of one or more attributes as said first set of one or more attributes, wherein said optimal set contributes to a problem associated with said networked system;

    storing said optimal set of one or more attributes in a data repository coupled to said computing system;

    generating a set of best practices for configuring said networked system to avoid said problem; and

    following said set of best practices by a configuration of said networked system, wherein a result of said following said set of best practices is said configuration excluding any entity having said optimal set of one or more attributes.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×