Automated hypothesis testing
First Claim
1. A non-transitory computer readable medium including computer executable instructions for:
- providing on a display a hierarchal map of a decision process for choosing a hypothesis test, to be executed using one or more data sets, from a plurality of hypothesis tests;
providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a plurality of data types;
providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a plurality of statistical parameters of interest;
providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a plurality of sample sizes;
providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a relationship between a pair of data sets;
receiving from an input device an indication of the data type of the one or more data sets;
receiving from the input device an indication of which of the plurality of statistical parameters of interest is to be tested;
receiving from the input device an indication of the sample size of the one or more data sets;
receiving from the input device an indication of which test of the plurality of tests is to be executed;
receiving from the input device an indication of the relationship of the pair of data sets;
selecting a test to execute based on one or more received indications;
providing on the display a description of the selected test;
providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a difference of interest;
receiving from the input device an indication of the difference of interest;
receiving from the input device an indication of a location of the data sets in a memory;
providing on the display a plurality of hypotheses based on the data sets;
receiving from the input device an indication of which of the plurality of hypotheses to test;
providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a significance level;
receiving from the input device an indication of the significance level for the selected test;
executing the selected test on the one or more data sets based on indications received;
providing on the display an explanation, understandable by a user unfamiliar with statistical analysis, of the results of executing the test;
providing on the display a summary, understandable by a user unfamiliar with statistical analysis, of the results of executing the test; and
providing on the display one or more graphs, understandable by a user unfamiliar with statistical analysis, of the results of executing the test.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of automatically applying a hypothesis test to a data set. The method reduces errors made in failing to appreciate predicate assumptions of various statistical tests, and elicits a series of indications from the user regarding characteristics of interest embodied by the data set to select an appropriate statistical test. The system also reduces errors in constructing competing null and alternative hypothesis statements by generating a characterization of the data and defining null and alternative hypotheses according to the indications, selected statistical test, and conventions adopted with respect to the tests. The system also establishes a significance level, calculates the test statistic, and generates an output. The output of the system provides a plain interpretation of the quantitative results in the terms indicated by the user to reduce errors in interpretation of the conclusion.
13 Citations
10 Claims
-
1. A non-transitory computer readable medium including computer executable instructions for:
-
providing on a display a hierarchal map of a decision process for choosing a hypothesis test, to be executed using one or more data sets, from a plurality of hypothesis tests; providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a plurality of data types; providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a plurality of statistical parameters of interest; providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a plurality of sample sizes; providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a relationship between a pair of data sets; receiving from an input device an indication of the data type of the one or more data sets; receiving from the input device an indication of which of the plurality of statistical parameters of interest is to be tested; receiving from the input device an indication of the sample size of the one or more data sets; receiving from the input device an indication of which test of the plurality of tests is to be executed; receiving from the input device an indication of the relationship of the pair of data sets; selecting a test to execute based on one or more received indications; providing on the display a description of the selected test; providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a difference of interest; receiving from the input device an indication of the difference of interest; receiving from the input device an indication of a location of the data sets in a memory; providing on the display a plurality of hypotheses based on the data sets; receiving from the input device an indication of which of the plurality of hypotheses to test; providing on the display a description, understandable by a user unfamiliar with statistical analysis, of a significance level; receiving from the input device an indication of the significance level for the selected test; executing the selected test on the one or more data sets based on indications received; providing on the display an explanation, understandable by a user unfamiliar with statistical analysis, of the results of executing the test; providing on the display a summary, understandable by a user unfamiliar with statistical analysis, of the results of executing the test; and providing on the display one or more graphs, understandable by a user unfamiliar with statistical analysis, of the results of executing the test. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
Specification