Statistical models for improving the performance of database operations
First Claim
1. A method for an automatic software-driven statistical evaluation of a large amount of data to be assigned to statistical variables in a database contained in at least one cluster, the method comprising:
- developing a statistical model that approximately describes at least one relative frequency of the states of the statistical variables and a statistical dependency between the states of the statistical variables;
determining an approximate relative frequency of the states of the statistical variables and an approximate relative frequency belonging to an at least one pre-determined relative frequency of the states of the statistical variables and an expected value of the states of the statistical variables dependent thereon by using data stored in the database and the statistical model;
and performing a statistical evaluation of at least one of;
(a) customer data in a Web reporting/Web mining area;
(b) customer data in a customer relationship management system;
(c) an environmental database;
(d) a medical database; and
(e) a genome database;
and outputting results of the statistical evaluation.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for performing an automatic software-driven statistical evaluation of a large amount of data to be assigned to statistical variables in a database contained in at least one cluster. The method is characterized by using a statistical model to model an approximate description of a relative frequency of the state or states of the statistical variables and a statistical dependencies between the state or states, and then determining the approximate relative frequency of the state or states of the statistical variables and the approximate relative frequency belonging to a predetermined relative frequency of the state or states of the statistical variables and an expected value of the state or states of the statistical variables dependent thereon.
16 Citations
16 Claims
-
1. A method for an automatic software-driven statistical evaluation of a large amount of data to be assigned to statistical variables in a database contained in at least one cluster, the method comprising:
-
developing a statistical model that approximately describes at least one relative frequency of the states of the statistical variables and a statistical dependency between the states of the statistical variables; determining an approximate relative frequency of the states of the statistical variables and an approximate relative frequency belonging to an at least one pre-determined relative frequency of the states of the statistical variables and an expected value of the states of the statistical variables dependent thereon by using data stored in the database and the statistical model; and performing a statistical evaluation of at least one of; (a) customer data in a Web reporting/Web mining area; (b) customer data in a customer relationship management system; (c) an environmental database; (d) a medical database; and (e) a genome database; and outputting results of the statistical evaluation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A method for an automatic software-driven statistical evaluation of a large amount of data to be assigned to statistical variables in a database contained in one or several clusters comprising;
-
subdividing the data into many clusters by a distance-based clustering algorithm, wherein the data considered is restricted to the data contained in at least one cluster; determining at least one relative frequency and at least one expected value of states of statistical variables by using a database reporting method or a OLAP method; and performing a statistical evaluation of at least one of; (a) customer data in a Web reporting/Web mining area; (b) customer data in a customer relationship management system; (c) an environmental database; (d) a medical database; and (e) a genome database; and outputting results of the statistical evaluation.
-
Specification