Pruning of columns in synopsis tables
First Claim
Patent Images
1. A computer program product for data management, comprising:
- a computer readable storage medium and program instructions stored on the computer readable storage medium, the program instructions comprising;
program instructions to organize a plurality of columns in one or more data tables, wherein respective columns of the plurality of columns are utilized upon receiving a query that includes a predicate to search a main data table;
program instructions to monitor a usage of the plurality of columns in the one or more data tables using an in-memory structure, wherein the in-memory structure stores, for each column in the plurality of columns, (i) a predicate type and (ii) a count of queries that have used the respective column to search the main data table; and
program instructions to drop one or more columns of the plurality of columns in the one or more data tables, based on a period of time threshold which defines an effectiveness for optimizing data queries, wherein dropping the one or more columns comprises dropping a subset of a plurality of content associated with at least one type of data structure within the one or more data tables.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments of the present invention provide systems and methods for data management. Synopsis tables have been found to be more effective for maintaining a high level of system performance while answering analytical queries. Synopsis tables, which contain MAX, MIN, and Bloom filter columns, may be modified by dropping ineffective data content within these columns and regenerating dropped data when beneficial. By automatically modifying data, database queries may be optimized.
-
Citations
12 Claims
-
1. A computer program product for data management, comprising:
-
a computer readable storage medium and program instructions stored on the computer readable storage medium, the program instructions comprising; program instructions to organize a plurality of columns in one or more data tables, wherein respective columns of the plurality of columns are utilized upon receiving a query that includes a predicate to search a main data table; program instructions to monitor a usage of the plurality of columns in the one or more data tables using an in-memory structure, wherein the in-memory structure stores, for each column in the plurality of columns, (i) a predicate type and (ii) a count of queries that have used the respective column to search the main data table; and program instructions to drop one or more columns of the plurality of columns in the one or more data tables, based on a period of time threshold which defines an effectiveness for optimizing data queries, wherein dropping the one or more columns comprises dropping a subset of a plurality of content associated with at least one type of data structure within the one or more data tables. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer system for data management, comprising:
-
one or more computer processors; one or more computer readable storage media; and program instructions stored on the one or more computer readable storage media for execution by at least one of the one or more processors, the program instructions comprising; program instructions to organize a plurality of columns in one or more data tables, wherein respective columns of the plurality of columns are utilized upon receiving a query that includes a predicate to search a main data table; program instructions to monitor a usage of the plurality of columns in the one or more data tables using an in-memory structure, wherein the in-memory structure stores, for each column in the plurality of columns, (i) a predicate type and (ii) a count of queries that have used the respective column to search the main data table; and program instructions to drop one or more columns of the plurality of columns in the one or more data tables, based on a period of time threshold which defines an effectiveness for optimizing data queries, wherein dropping the one or more columns comprises dropping a subset of a plurality of content associated with at least one type of data structure within the one or more data tables. - View Dependent Claims (8, 9, 10, 11, 12)
-
Specification