Dynamic Data Partitioning For Optimal Resource Utilization In A Parallel Data Processing System
First Claim
1. A computer-implemented method for dynamically distributing data for parallel processing in a computing system, comprising:
- allocating a data buffer to each of a plurality of data partitions, wherein each data buffer stores data to be processed by its corresponding data partition;
distributing data in multiple rounds to the data buffers for processing by the data partitions, wherein in each round the data is distributed based on a determined data processing capacity for each data partition, wherein a greater amount of data is distributed to the data partitions with higher determined processing capacities; and
periodically monitoring usage of each data buffer and re-determining the determined data processing capacity of each data partition based on its corresponding data buffer usage.
0 Assignments
0 Petitions
Accused Products
Abstract
A method, computer program product, and system for dynamically distributing data for parallel processing in a computing system, comprising allocating a data buffer to each of a plurality of data partitions, where each data buffer stores data to be processed by its corresponding data partition, distributing data in multiple rounds to the data buffers for processing by the data partitions, where in each round the data is distributed based on a determined data processing capacity for each data partition, and where a greater amount of data is distributed to the data partitions with higher determined processing capacities, and periodically monitoring usage of each data buffer and re-determining the determined data processing capacity of each data partition based on its corresponding data buffer usage.
97 Citations
9 Claims
-
1. A computer-implemented method for dynamically distributing data for parallel processing in a computing system, comprising:
-
allocating a data buffer to each of a plurality of data partitions, wherein each data buffer stores data to be processed by its corresponding data partition; distributing data in multiple rounds to the data buffers for processing by the data partitions, wherein in each round the data is distributed based on a determined data processing capacity for each data partition, wherein a greater amount of data is distributed to the data partitions with higher determined processing capacities; and periodically monitoring usage of each data buffer and re-determining the determined data processing capacity of each data partition based on its corresponding data buffer usage. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
Specification