Pruning of columns in synopsis tables
First Claim
Patent Images
1. A method for data management, comprising the steps of:
- organizing, by one or more processors, a plurality of columns in one or more data tables, wherein respective columns of the plurality of columns are utilized upon receiving a query that includes a predicate to search a main data table;
monitoring, by one or more processors, a usage of the plurality of columns in the one or more data tables using an in-memory structure, wherein the in-memory structure stores, for each column in the plurality of columns, (i) a predicate type and (ii) a count of queries that have used the respective column to search the main data table; and
dropping, by one or more processors, one or more columns of the plurality of columns in the one or more data tables, based on a period of time threshold which defines an effectiveness for optimizing data queries, wherein dropping the one or more columns comprises dropping a subset of a plurality of content associated with at least one type of data structure within the one or more data tables.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments of the present invention provide systems and methods for data management. Synopsis tables have been found to be more effective for maintaining a high level of system performance while answering analytical queries. Synopsis tables, which contain MAX, MIN, and Bloom filter columns, may be modified by dropping ineffective data content within these columns and regenerating dropped data when beneficial. By automatically modifying data, database queries may be optimized.
55 Citations
6 Claims
-
1. A method for data management, comprising the steps of:
-
organizing, by one or more processors, a plurality of columns in one or more data tables, wherein respective columns of the plurality of columns are utilized upon receiving a query that includes a predicate to search a main data table; monitoring, by one or more processors, a usage of the plurality of columns in the one or more data tables using an in-memory structure, wherein the in-memory structure stores, for each column in the plurality of columns, (i) a predicate type and (ii) a count of queries that have used the respective column to search the main data table; and dropping, by one or more processors, one or more columns of the plurality of columns in the one or more data tables, based on a period of time threshold which defines an effectiveness for optimizing data queries, wherein dropping the one or more columns comprises dropping a subset of a plurality of content associated with at least one type of data structure within the one or more data tables. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification