Automatically identifying an optimal set of attributes to facilitate generating best practices for configuring a networked system
First Claim
1. A computer-implemented method of automatically identifying an optimal set of attributes of entities to facilitate generating best practices for configuring a networked system, comprising:
- ranking, by a computing system and based on a plurality of information gain values, a plurality of entity types of a plurality of entities included in said networked system;
determining, by said computing system and subsequent to said ranking, a first classification accuracy relative to a first entity type, wherein said first entity type is a highest ranked entity type of said plurality of entity types based on said ranking or is a first aggregate entity type associated with two or more entity types of said plurality of entity types;
selecting, by said computing system and subsequent to said determining said first measurement, a second entity type of said plurality of entity types, wherein said selecting is based on said ranking;
performing, by said computing system, a database join operation on a first set of one or more attributes of one or more entities of said first entity type and a second set of one or more attributes of one or more entities of said second entity type, wherein a result of said performing is a second aggregate entity type;
determining, by said computing system, a second classification accuracy relative to said second aggregate entity type;
determining, by said computing system, that said second classification accuracy is less than or equal to said first classification accuracy;
identifying, by said computing system and in response to said determining that said second measurement is less than or equal to said first measurement, an optimal set of one or more attributes as said first set of one or more attributes, wherein said optimal set contributes to a problem associated with said networked system; and
storing said optimal set in a data repository coupled to said computing system.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and system for automatically identifying an optimal set of attributes of entities included in a networked system. Entity types are ranked based on information gain. A first classification accuracy relative to a first entity type is determined. The first entity type is the top-ranked entity type or a first aggregate entity type. A second entity type is selected base on the ranking. A database join of a first set of attributes associated with the first entity type and a second set of attributes associated with the second entity type is performed. A second classification accuracy relative to a second aggregate entity type generated by the join is determined. In response to determining that the second classification accuracy is not greater than the first classification accuracy, an optimal set of attributes contributing to a problem in the networked system is identified as the first set of attributes.
18 Citations
20 Claims
-
1. A computer-implemented method of automatically identifying an optimal set of attributes of entities to facilitate generating best practices for configuring a networked system, comprising:
-
ranking, by a computing system and based on a plurality of information gain values, a plurality of entity types of a plurality of entities included in said networked system; determining, by said computing system and subsequent to said ranking, a first classification accuracy relative to a first entity type, wherein said first entity type is a highest ranked entity type of said plurality of entity types based on said ranking or is a first aggregate entity type associated with two or more entity types of said plurality of entity types; selecting, by said computing system and subsequent to said determining said first measurement, a second entity type of said plurality of entity types, wherein said selecting is based on said ranking; performing, by said computing system, a database join operation on a first set of one or more attributes of one or more entities of said first entity type and a second set of one or more attributes of one or more entities of said second entity type, wherein a result of said performing is a second aggregate entity type; determining, by said computing system, a second classification accuracy relative to said second aggregate entity type; determining, by said computing system, that said second classification accuracy is less than or equal to said first classification accuracy; identifying, by said computing system and in response to said determining that said second measurement is less than or equal to said first measurement, an optimal set of one or more attributes as said first set of one or more attributes, wherein said optimal set contributes to a problem associated with said networked system; and storing said optimal set in a data repository coupled to said computing system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computing system comprising a processor and a computer-readable memory unit coupled to said processor, said memory unit containing instructions that when executed by said processor implement a method of automatically identifying an optimal set of attributes of entities to facilitate generating best practices for configuring a networked system, wherein said method comprises:
-
ranking, based on a plurality of information gain values, a plurality of entity types of a plurality of entities included in said networked system; determining, subsequent to said ranking, a first classification accuracy relative to a first entity type, wherein said first entity type is a highest ranked entity type of said plurality of entity types based on said ranking or is a first aggregate entity type associated with two or more entity types of said plurality of entity types; selecting, subsequent to said determining said first measurement, a second entity type of said plurality of entity types, wherein said selecting is based on said ranking; performing a database join operation on a first set of one or more attributes of one or more entities of said first entity type and a second set of one or more attributes of one or more entities of said second entity type, wherein a result of said performing is a second aggregate entity type; determining a second classification accuracy relative to said second aggregate entity type; determining that said second classification accuracy is less than or equal to said first classification accuracy; identifying, in response to said determining that said second measurement is less than or equal to said first measurement, an optimal set of one or more attributes as said first set of one or more attributes, wherein said optimal set contributes to a problem associated with said networked system; and storing said optimal set in a data repository coupled to said computing system. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computer program product, comprising a computer non-transitory storage medium having a computer readable program code embodied therein, said computer readable program code containing instructions that when executed by a processor of a computing system implement a method of automatically identifying an optimal set of attributes of entities to facilitate generating best practices for configuring a networked system, said method comprising:
-
ranking, based on a plurality of information gain values, a plurality of entity types of a plurality of entities included in said networked system; determining, subsequent to said ranking, a first classification accuracy relative to a first entity type, wherein said first entity type is a highest ranked entity type of said plurality of entity types based on said ranking or is a first aggregate entity type associated with two or more entity types of said plurality of entity types; selecting, subsequent to said determining said first measurement, a second entity type of said plurality of entity types, wherein said selecting is based on said ranking; performing a database join operation on a first set of one or more attributes of one or more entities of said first entity type and a second set of one or more attributes of one or more entities of said second entity type, wherein a result of said performing is a second aggregate entity type; determining a second classification accuracy relative to said second aggregate entity type; determining that said second classification accuracy is less than or equal to said first classification accuracy; identifying, in response to said determining that said second measurement is less than or equal to said first measurement, an optimal set of one or more attributes as said first set of one or more attributes, wherein said optimal set contributes to a problem associated with said networked system; and storing said optimal set in a data repository coupled to said computing system. - View Dependent Claims (17, 18, 19, 20)
-
Specification