INFRASTRUCTURE FOR PARALLEL PROGRAMMING OF CLUSTERS OF MACHINES
First Claim
Patent Images
1. A product comprising:
- a machine readable medium;
first operator logic stored on the medium and operable to;
implement a first data processing operation in parallel over multiple processing nodes, the first data processing operation customized with a first user-defined function executed on the multiple processing nodes; and
second operator logic stored on the medium and operable to;
implement a second data processing operation in parallel over the multiple processing nodes, the second data processing operation customized with a second user-defined function executed on the multiple processing nodes;
where the first or second operator logic comprises block operator logic;
where the first or second user-defined function comprises a user-defined block function; and
where the block operator logic is operable to apply the user-defined block function to records of a first vector.
7 Assignments
0 Petitions
Accused Products
Abstract
GridBatch provides an infrastructure framework that hides the complexities and burdens of developing logic and programming application that implement detail parallelized computations from programmers. A programmer may use GridBatch to implement parallelized computational operations that minimize network bandwidth requirements, and efficiently partition and coordinate computational processing in a multiprocessor configuration. GridBatch provides an effective and lightweight approach to rapidly build parallelized applications using economically viable multiprocessor configurations that achieve the highest performance results.
71 Citations
25 Claims
-
1. A product comprising:
-
a machine readable medium; first operator logic stored on the medium and operable to; implement a first data processing operation in parallel over multiple processing nodes, the first data processing operation customized with a first user-defined function executed on the multiple processing nodes; and second operator logic stored on the medium and operable to; implement a second data processing operation in parallel over the multiple processing nodes, the second data processing operation customized with a second user-defined function executed on the multiple processing nodes; where the first or second operator logic comprises block operator logic; where the first or second user-defined function comprises a user-defined block function; and where the block operator logic is operable to apply the user-defined block function to records of a first vector. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method for processing data in parallel comprising:
-
initiating execution of a first data processing operation in parallel over multiple processing nodes, the first data processing operation customized with a first user-defined function executed on the multiple processing nodes; and initiating execution of a second data processing operation in parallel over the multiple processing nodes, the second data processing operation customized with a second user-defined function executed on the multiple processing nodes; where the first or second data processing operation comprises block operator logic; where the first or second user-defined function comprises a user-defined block function; and where the block operator logic applies the user-defined block function to records of a first vector. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
Specification