Performing cross-tabulation using a columnar database management system
First Claim
1. A columnar database data mining method, comprising:
- receiving a data mining request, the data mining request indicating a cross-tabulation operation, a first table, and a second table, the receiving performed by the columnar database management system, the columnar database management system having a processor and a non-transitory computer-readable memory;
performing, by the columnar database management system, the cross-tabulation operation on the first table and the second table, the cross-tabulation operation comprising a horizontal collapse process and a vertical collapse process,the horizontal collapse process comprising;
determining domains, each domain comprising a set of records of the first table, the set of record matching a condition or value in a column of the first table, the column representing a row dimension for a result table, each row of the result table defining a record resulting from the cross-tabulation operation; and
determining an intersection set of domains;
the vertical collapse process comprising;
moving the domains towards a cross-tabulation resolution level, the moving comprising mapping sets of records in the intersection set of domains to records of the second table; and
determining, by the columnar database management system, a data mining result, the data mining result comprising an aggregate value from a record of the result table.
2 Assignments
0 Petitions
Accused Products
Abstract
Cross-tabulation operation is performed within a columnar database management system. The columnar database management system receives a request to perform a cross-tabulation operation on a set of database tables. The columnar database management system determines values of cross-tabulation operation for each row of the result. The columnar database management system determines a domain for each value of the row dimension corresponding to a row combination. The columnar database management system determines an intersection set of the domains corresponding to values of the row dimensions for the row combination. The columnar database management system determines a value for the result column for the row combination as an aggregate value based on the records of the intersection set.
-
Citations
20 Claims
-
1. A columnar database data mining method, comprising:
-
receiving a data mining request, the data mining request indicating a cross-tabulation operation, a first table, and a second table, the receiving performed by the columnar database management system, the columnar database management system having a processor and a non-transitory computer-readable memory; performing, by the columnar database management system, the cross-tabulation operation on the first table and the second table, the cross-tabulation operation comprising a horizontal collapse process and a vertical collapse process, the horizontal collapse process comprising; determining domains, each domain comprising a set of records of the first table, the set of record matching a condition or value in a column of the first table, the column representing a row dimension for a result table, each row of the result table defining a record resulting from the cross-tabulation operation; and determining an intersection set of domains; the vertical collapse process comprising; moving the domains towards a cross-tabulation resolution level, the moving comprising mapping sets of records in the intersection set of domains to records of the second table; and determining, by the columnar database management system, a data mining result, the data mining result comprising an aggregate value from a record of the result table. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A columnar database management system, comprising:
-
a processor; a non-transitory computer-readable medium; and stored instructions translatable by the processor to perform; receiving a data mining request, the data mining request indicating a cross-tabulation operation, a first table, and a second table; performing the cross-tabulation operation on the first table and the second table, the cross-tabulation operation comprising a horizontal collapse process and a vertical collapse process, the horizontal collapse process comprising; determining domains, each domain comprising a set of records of the first table, the set of record matching a condition or value in a column of the first table, the column representing a row dimension for a result table, each row of the result table defining a record resulting from the cross-tabulation operation; and determining an intersection set of domains; the vertical collapse process comprising; moving the domains towards a cross-tabulation resolution level, the moving comprising mapping sets of records in the intersection set of domains to records of the second table; and determining a data mining result, the data mining result comprising an aggregate value from a record of the result table. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification