Compressing database workloads
First Claim
1. A computer system for evaluating a workload having a plurality of statements based on a application that uses the workload comprising:
- a database management system including a log that contains an initial workload W containing a plurality of statements;
an evaluation component that determines a distance function for pairs of statements within the workload which returns an estimate of the loss in quality for an application if a first statement is discarded from the workload but a second statement of the pair of statements is retained in a compressed workload; and
a search component that finds a compressed workload W′
by evaluating the distance function for pairs of statements to produce a compressed workload having fewer statements than the initial workload W.
2 Assignments
0 Petitions
Accused Products
Abstract
Relational database applications such as index selection, histogram tuning, approximate query processing, and statistics selection have recognized the importance of leveraging workloads. Often these applications are presented with large workloads, i.e., a set of SQL DML statements, as input. A key factor affecting the scalability of such applications is the size of the workload. The invention concerns workload compression which helps improve the scalability of such applications. The exemplary embodiment is broadly applicable to a variety of workload-driven applications, while allowing for incorporation of application specific knowledge. The process is described in detail in the context of two workload-driven applications: index selection and approximate query processing.
31 Citations
45 Claims
-
1. A computer system for evaluating a workload having a plurality of statements based on a application that uses the workload comprising:
-
a database management system including a log that contains an initial workload W containing a plurality of statements;
an evaluation component that determines a distance function for pairs of statements within the workload which returns an estimate of the loss in quality for an application if a first statement is discarded from the workload but a second statement of the pair of statements is retained in a compressed workload; and
a search component that finds a compressed workload W′
by evaluating the distance function for pairs of statements to produce a compressed workload having fewer statements than the initial workload W. - View Dependent Claims (2, 3, 4, 5, 9, 10, 11)
-
-
6. For use in a computer database system, a method for compressing a workload used by an application and including a plurality of statements comprising:
-
providing a distance function for pairs of statements within a workload which returns an estimate of the loss in quality for an application if a first statement is discarded from the workload but a second statement of the pair of statements is retained in a compressed workload; and
finding a compressed workload W′
by evaluating the distance function for pairs of statements to produce a compressed workload having fewer statements than an initial workload W. - View Dependent Claims (7, 8, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. For use in a computer database system, a computer readable medium having instructions stored on said medium for:
-
providing a distance function for pairs of statements within a workload which returns an estimate of the loss in quality for an application if a first statement is discarded from the workload but a second statement of the pair of statements is retained in a compressed workload; and
finding a compressed workload W′
by evaluating the distance function for pairs of statements to produce a compressed workload having fewer statements than an initial workload W. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45)
-
Specification