SYSTEM AND METHOD FOR PARALLEL COMPUTATION OF FREQUENCY HISTOGRAMS ON JOINED TABLES
First Claim
Patent Images
1. A method comprising:
- reading data in a table row-by-row from a database system using a coordinator unit;
distributing each read row to separate worker units;
computing a partial frequency histogram for each column in said table using each worker unit in parallel;
merging partial histograms from said worker units; and
sending said merged frequency histograms to said worker units using said coordinator unit.
1 Assignment
0 Petitions
Accused Products
Abstract
According to one embodiment of the present invention, a method for the parallel computation of frequency histograms in joined tables is provided. The method includes reading data in a table row-by-row from a database system using a coordinator unit and distributing each read row to separate worker units. Each worker unit computes a partial frequency histogram for each column in the table in parallel. The partial histograms from the worker units are then merged and the coordinator unit sends the merged frequency histograms to the worker units.
39 Citations
25 Claims
-
1. A method comprising:
-
reading data in a table row-by-row from a database system using a coordinator unit; distributing each read row to separate worker units; computing a partial frequency histogram for each column in said table using each worker unit in parallel; merging partial histograms from said worker units; and sending said merged frequency histograms to said worker units using said coordinator unit. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method comprising:
-
reading data in a table row-by-row from a database system using a coordinator unit, said table being at a first join depth; distributing each read row to separate worker units; computing a partial frequency histogram for each column in said table using each worker in parallel; merging partial histograms from said worker units; sending said merged frequency histograms to said worker units using said coordinator unit; said coordinator unit reading rows from a table at a second join depth and distributing each said second join depth table to one of said worker units; and said worker units computing frequency histograms for each column in said second join depth table using said merged frequency histograms from said first join depth tables. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A database system comprising:
-
a coordinator unit; a plurality of worker units; a set of tables at a first join depth, each table having columns and rows, wherein said coordinator unit reads data in said tables row-by-row and distributes each row to separate worker units; said worker units operating in parallel to compute a partial frequency histogram for each column in said table received from said coordinator unit; and said coordinator unit merging partial histograms form said worker units and sending said merged frequency histograms to said worker units. - View Dependent Claims (19, 20)
-
-
21. A computer program product for computing frequency histograms, said computer program product comprising:
-
a computer usable medium having computer usable program code embodied therewith, said computer usable program code comprising; computer usable program code configured to; read data in a table row-by-row from a database system using a coordinator unit; distribute each read row to separate worker units; compute a partial frequency histogram for each column in said table using each worker unit in parallel; merge partial histograms from said worker units; and send said merged frequency histograms to said worker units using said coordinator unit. - View Dependent Claims (22, 23, 24, 25)
-
Specification