×

Analysis of data in cause and effect relationships

  • US 5,850,339 A
  • Filed: 10/31/1996
  • Issued: 12/15/1998
  • Est. Priority Date: 10/31/1996
  • Status: Expired due to Fees
First Claim
Patent Images

1. In conjunction with a repeated process wherein a plurality of independent input process variables results in a dependent output variable having either of exactly two outcomes, a method for implementation in a computer for evaluating a data set which comprises a plurality of records each corresponding to a single operation of the process and each record including the respective values of the independent variables and the outcome of the dependent variable for that single operation of the process, wherein each independent variable can be either numeric or categoric, the method determining a combination of a specific number of the independent variables and boundaries defining an included region of values for each of said specific number of independent variables which most likely results in a specific outcome of the dependent variable, the method comprising the steps of:

  • a) for each independent numeric variablea1) determining its range of values;

    a2) selecting an initial boundary within the determined range;

    a3) calculating a score where the included region is on each side of the initial boundary, wherein each said score is a measure of the frequency of occurrence of the specific outcome of the dependent variable when said each independent numeric variable has a value in the respective included region;

    a4) selecting the side of the initial boundary which resulted in the higher score to define an initial included region of values for said each independent numeric variable;

    a5) iteratively adjusting the boundary of the included region so as to alter the size of the included region and calculating the score based upon the altered included region for each boundary adjustment; and

    a6) selecting as the final boundary that boundary which provided the highest score;

    b) for each independent categoric variableb1) calculating a score for each value of said each independent categoric variable, wherein each said score is a measure of the frequency of occurrence of the specific outcome of the dependent variable when said each independent categoric variable has said each value; and

    b2) selecting that value which provided the highest score;

    c) ranking all the independent variables in order of their scores;

    d) identifying the specific number of the independent variables which have the highest scores; and

    e) providing as an output a list of the identified independent variables ande1) the included region identified by the final boundary for each independent numeric variable; and

    e2) the selected value for each independent categoric variable.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×