Data analyzing method and system
First Claim
1. A data analyzing method, utilized in an information processing system including a user interface and a rule generation module, of generating a rule based on a plurality of data items stored in a data base, said rule expressing relational features of said data items, said method comprising the steps, executed by said information processing system, of:
- selecting data items for use in an IF clause and a THEN clause of an IF, THEN rule from said data items stored in said data base;
when said selected data items have numerical values, converting said numerical values into symbolic values;
creating plural candidate rules in said rule generation module, each rule expressing a correlation between selected data items in a rule form having one or plural sets of item names and symbolic values;
calculating a criterion for evaluating strength of correlation between data items in each of said candidate rules;
determining one or plural candidate rules having highest calculated criterion from said candidate rules; and
outputting said one or plural candidate rules having the highest calculated criterion.
1 Assignment
0 Petitions
Accused Products
Abstract
A data analyzing method and system for generating a rule based on data items in a data base, wherein the rule expresses relational features of the data items. The invention includes a user interface and a rule generation module. The rule generation module, in response to an input from the user via the user interface, selects data items for use in a conditional clause and a conclusion clause of a rule from the data items stored in the data base, converts, when the selected data items have numerical values, the numerical values into symbolic values and creates plural candidate rules each expressing a correlation between selected data items in a rule form having one or plural sets of item names and symbolic values. The rule generation module further calculates a criterion for evaluating strength of correlation between data items in each of the candidate rules, determines one or plural candidate rules having highest calculated criterion from the candidate rules, and outputs to the user via the user interface the one or plural candidate rules.
-
Citations
44 Claims
-
1. A data analyzing method, utilized in an information processing system including a user interface and a rule generation module, of generating a rule based on a plurality of data items stored in a data base, said rule expressing relational features of said data items, said method comprising the steps, executed by said information processing system, of:
-
selecting data items for use in an IF clause and a THEN clause of an IF, THEN rule from said data items stored in said data base; when said selected data items have numerical values, converting said numerical values into symbolic values; creating plural candidate rules in said rule generation module, each rule expressing a correlation between selected data items in a rule form having one or plural sets of item names and symbolic values; calculating a criterion for evaluating strength of correlation between data items in each of said candidate rules; determining one or plural candidate rules having highest calculated criterion from said candidate rules; and outputting said one or plural candidate rules having the highest calculated criterion. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A data analyzing method, utilized in an information processing system including a user interface and a rule generation module, of generating a rule based on a plurality of data items stored in a data base, said rule expressing relational features of said data items, said method comprising the steps, executed by said information processing system, of:
-
selecting one conclusion item as a data item for use in a conclusion clause of a rule from said data items stored in said data base; selecting condition items as data items for use in a condition clause of said rule from said data items stored in said data base; when said selected data items have numerical values, converting said numerical values into symbolic values; combining one or plural sets of item names and symbolic values thereof in the condition items to create a condition clause, combining an item name and a symbolic value thereof in the conclusion items to create a conclusion clause, and combining the thus-created condition and conclusion clauses to create plural candidate rules; calculating a criterion for evaluating said candidate rules; determining one or plural candidate rules out of said candidate rules having a high evaluation level; and outputting said one or plural candidate rules determined by said determining step as having the high evaluation level. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A data analyzing system which groups data of numerical or symbolic values as values of plural data items into one record, then inputs and analyzes data including a plurality of said records, and creates and outputs a rule expressing features of said data, said system comprising:
-
means for selecting data items for use in an IF clause and a THEN clause of an IF, THEN rule out of said plural data items; means for converting the numerical values into symbolic values, when the selected data items have numerical values; means for creating plural candidate rules expressing a correlation between selected data items in a rule form having one or plural sets of item names and symbolic values; means for calculating a criterion for evaluating the strength of the correlation expressed by said candidate rules; means for determining one or plural candidate rules having the highest of said criterion out of the created plural candidate rules; and means for outputting the thus-determined candidate rules having the highest criterion.
-
-
24. A data analyzing system comprising:
-
a user interface for interfacing with a user; a memory for storing a data base including a plurality of data items; and a rule generation module for generating a rule based on said data items in said data base, said rule expressing relational features of said data items; wherein said rule generation module, in response to an input from said user via said user interface, selects data items for use in an IF clause and a THEN clause of an IF, THEN rule from said data items stored in said data base;
converts and numerical values into symbolic values, when said selected data items have numerical values;
creates plural candidate rules each expressing a correlation between selected data items in a rule form having one or plural sets of item names and symbolic values;
calculates a criterion for evaluating strength of correlation between data items in each of said candidate rules;
determines one or plural candidate rules having highest calculated criterion from said candidate rules, and outputs said one or plural candidate rules to said user via said user interface. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32, 33)
-
-
34. A data analyzing system comprising:
-
a user interface for interfacing with a user; a memory for storing a data base including a plurality of data items; and a rule generation module for generating a rule based on said data items in said data base, said rule expressing relational features of said data items; wherein said rule generation module, in response to an input from said user via said interface, selects one conclusion item as a data item for use in a conclusion clause of a rule from said data items stored in said data base;
selects condition items as data items for use in a condition clause of said rule from said data items stored in said data base;
converts said numerical values into symbolic values, when said selected data items have numerical values;
combines one or plural sets of item names and symbolic values thereof in the condition items to create a condition clause;
combines an item name and a symbolic value thereof in the conclusion items to create a conclusion clause;
combines the thus-created condition and conclusion clauses to create plural candidate rules;
calculates a criterion for evaluating said candidate rules;
determines one or plural candidate rules out of said candidate rules having a highest evaluation level; and
outputs to said user via said user interface said one or plural candidate rules having the highest evaluation level. - View Dependent Claims (35, 36, 37, 38, 39, 40, 41, 42, 43, 44)
-
Specification