System and method of data mining
First Claim
Patent Images
1. A method for formulating a set of rules representing a situation, comprising:
- finding within a collection of data related to said situation a representative collection of data comprising attribute patterns and associated conclusions;
forming said set of rules by;
a) comparing a selected attribute pattern to all other attribute patterns associated with conclusions different than that of said selected attribute pattern in said representative collection to match irrelevant attribute elements between said selected attribute pattern and said compared attribute patterns;
b) removing said irrelevant attribute elements from said selected attribute pattern; and
repeating a) and b) for each attribute pattern in said representative collection.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for mining data for intelligent patterns uses logical techniques to isolate unique patterns and reduce the patterns to a consistent set of rules representing the data. The system and method eliminates attributes in the patterns that do not contribute to an associated conclusion and are deemed irrelevant. This approach does not splinter significant patterns within the data, as may occur with statistical approaches. In addition, the system and method identifies areas of incomplete data that are not recognized in other methods.
-
Citations
30 Claims
-
1. A method for formulating a set of rules representing a situation, comprising:
-
finding within a collection of data related to said situation a representative collection of data comprising attribute patterns and associated conclusions;
forming said set of rules by;
a) comparing a selected attribute pattern to all other attribute patterns associated with conclusions different than that of said selected attribute pattern in said representative collection to match irrelevant attribute elements between said selected attribute pattern and said compared attribute patterns;
b) removing said irrelevant attribute elements from said selected attribute pattern; and
repeating a) and b) for each attribute pattern in said representative collection. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for formulating a set of rules representing a situation, comprising:
-
a storage media containing a set of data records related to said situation;
each of said data records includes an attribute pattern and an associated conclusion;
a processor operable to manipulate said set of data records to form a representative collection of attribute patterns and associated conclusions storable on said storage media;
said processor being further operable to manipulate said representative collection to remove attribute elements from each of said attribute patterns that are irrelevant to said associated conclusions to form a set of rules storable on said storage media; and
said processor is further operable to remove redundant ones of said rules from said set of rules to provide a complete and consistent rule set. - View Dependent Claims (11, 12, 13, 14, 15, 17, 20, 21, 22, 23, 27, 29, 30)
-
-
16. A computer readable memory storing a program code executable to form a set of rules representing a situation, said program code comprising:
-
a first code section executable to find within a collection of data related to said situation a representative collection of data comprising attribute patterns and associated conclusions;
a second code section executable to compare a selected attribute pattern to all other attribute patterns associated with conclusions different than that of said selected attribute pattern in said representative collection to match irrelevant attribute elements between said selected attribute pattern and said compared attribute patterns;
a third code section executable to remove said irrelevant attribute elements from said selected attribute pattern; and
a fourth code section containing logic executable to repeat said second and third code sections for each attribute pattern to form a set of rules.
-
-
18. A network of interconnected computers storing program code and data for forming a set of rules representing a situation, comprising:
-
a storage media coupled to said network and containing a set of data records related to said situation;
each of said data records includes an attribute pattern and an associated conclusion;
a processor coupled to said network and operable to manipulate said set of data records to form a representative collection of attribute patterns and associated conclusions storable on said storage media;
said processor being further operable to manipulate said representative collection to remove attribute elements from each of said attribute patterns that are irrelevant to said associated conclusions to form a set of rules storable on said storage media; and
said processor is further operable to remove redundant ones of said rules from said set of rules to provide a complete and consistent rule set.
-
-
19. A method for forming a set of rules representing a situation, comprising:
-
finding all non-redundant fact patterns related to said situation in a data set;
identifying at least one attribute in each fact pattern that contributes to a respective conclusion associated with said fact pattern; and
forming said set of rules using said identified attributes and said respective associated conclusions.
-
-
24. A carrier medium containing a program code executable to form a set of rules representing a situation, said program code comprising:
-
a first code section executable to find within a collection of data related to said situation a representative collection of data comprising attribute patterns and associated conclusions;
a second code section executable to compare a selected attribute pattern to all other attribute patterns associated with conclusions different than that of said selected attribute pattern in said representative collection to match irrelevant attribute elements between said selected attribute pattern and said compared attribute patterns;
a third code section executable to remove said irrelevant attribute elements from said selected attribute pattern; and
a fourth code section containing logic executable to repeat said second and third code sections for each attribute pattern to form a set of rules.
-
-
25. A processor operable to execute a program code from a storage memory, said program code comprising:
-
a first code section executable to find, within a collection of data related to a situation, a representative collection of data comprising attribute patterns and associated conclusions;
a second code section executable to compare a selected attribute pattern to all other attribute patterns associated with conclusions different than that of said selected attribute pattern in said representative collection to match irrelevant attribute elements between said selected attribute pattern and said compared attribute patterns;
a third code section executable to remove said irrelevant attribute elements from said selected attribute pattern;
a fourth code section containing logic executable to repeat said second and third code sections for each attribute pattern to form a set of rules; and
a fifth code section executable to remove redundant rules from said set of rules.
-
-
26. A method for formulating a set of rules representing a situation, comprising:
-
obtaining a set of data records related to said situation, each data record containing a set of attributes and an associated conclusion;
forming a first set of mutually exclusive attribute patterns from said data records, each attribute pattern being associated with a respective conclusion group containing at least one conclusion;
maintaining a count of data records associated each conclusion in each respective conclusion group;
forming a second set of attribute patterns from said first set, each attribute pattern in said second set being associated with a preferred conclusion chosen from said respective associated conclusion group, said attribute patterns in said second set containing attributes relevant to said situation, said second set of attribute patterns being formed by;
a) creating in said second set a copy of a selected attribute pattern with an associated preferred conclusion from said first set;
b) comparing said copied selected attribute pattern to all other attribute patterns in said first set having associated preferred conclusions different from said associated preferred conclusion of said copied selected attribute pattern thereby identifying any attributes of said copied selected attribute pattern that are irrelevant to said situation;
c) removing said irrelevant attributes from said copied selected attribute pattern in said second set; and
repeating a), b) and c) for each attribute pattern in said first set to form said second set of attribute patterns comprising said set of rules.
-
-
28. A method for formulating a set of rules representing a situation, comprising:
-
obtaining a set of data records, each data record containing a set of attributes forming an attribute pattern and an associated conclusion;
forming from said set of data records a first set of mutually exclusive attribute patterns each associated with a conclusion group containing at least one conclusion, said first set of attribute patterns being formed by;
a) placing a copy of an initial attribute pattern and an initial associated conclusion from an initial data record into said first set of attribute patterns, said initial associated conclusion being placed in a conclusion group in said first set of attribute patterns, and initializing a first conclusion count for said initial associated conclusion placed in said first conclusion group;
b) reading an attribute pattern and an associated conclusion from a selected data record;
c) comparing said read attribute pattern to all attribute patterns of said first set of attribute patterns;
d) if said read attribute pattern matches none of said first set of attribute patterns, adding said read attribute pattern and said read associated conclusion from said selected data record into said first set of attribute patterns, said read associated conclusion being placed in another conclusion group associated with said read attribute pattern added to said first set of attribute patterns, and initializing another conclusion count for said read associated conclusion in said another associated conclusion group;
e) if a match between said read attribute pattern and said first set of attribute patterns is found and if said read associated conclusion is already in a conclusion group associated with said matched attribute pattern in said first set of attribute patterns, incrementing a conclusion count for said read associated conclusion in said conclusion group associated with said matched attribute pattern, and if said read associated conclusion is not already in said conclusion group associated with said matched attribute pattern, adding said read associated conclusion to said conclusion group associated with said matched attribute pattern and initializing a conclusion count for said added read associated conclusion;
f) selecting another data record and reading an attribute pattern and an associated conclusion from said selected data record; and
repeating c) through f) until all attribute patterns for said set of data records are exhausted;
selecting a representative conclusion from each of said conclusion groups as a preferred conclusion based on criteria including said conclusion counts;
forming a second set of attribute patterns, each associated with respective preferred conclusions, said attribute patterns in said second set containing attributes relative to said situation, said second set of attribute patterns being formed by;
g) placing a copy of a selected attribute pattern and said associated preferred conclusion from said first set of attribute patterns into said second set of attribute patterns and comparing said copied selected attribute pattern to all other attribute patterns in said first set of attribute patterns having associated preferred conclusions different from said associated preferred conclusion of said copied selected attribute pattern thereby identifying any attributes of said copied selected attribute pattern that are irrelevant to said situation;
h) removing said irrelevant attributes from said copied selected attribute pattern in said second set; and
repeating g) and h) for each attribute pattern in said first set of attribute patterns to form said second set of attribute patterns, said second set of attribute patterns and associated preferred conclusions forming said set of rules.
-
Specification