Adaptive Parallel Data Processing
First Claim
Patent Images
1. A method comprising:
- receiving a request to process a data file;
splitting the data file into multiple portions;
sending the multiple portions to multiple nodes, each node configured to process a respective portion of the data file;
responsive to an amount of processing of the data file being completed, dynamically splitting at least one of the multiple portions of the data file that have yet to complete processing into multiple sub-portions; and
submitting the multiple sub-portions to one or more of the multiple nodes for processing of the multiple sub-portions.
3 Assignments
0 Petitions
Accused Products
Abstract
Adaptive parallel data processing techniques are described. In one or more embodiments, a request is received to process a data file. The data file is split into multiple portions and sent to multiple nodes, where each node is configured to process a respective portion of the data file. Responsive to an amount of processing of the data file being completed, at least one of the multiple portions of the data file is dynamically split into multiple sub-portions. The sub-portions are submitted to one or more of the multiple nodes for processing of the sub-portions.
26 Citations
20 Claims
-
1. A method comprising:
-
receiving a request to process a data file; splitting the data file into multiple portions; sending the multiple portions to multiple nodes, each node configured to process a respective portion of the data file; responsive to an amount of processing of the data file being completed, dynamically splitting at least one of the multiple portions of the data file that have yet to complete processing into multiple sub-portions; and submitting the multiple sub-portions to one or more of the multiple nodes for processing of the multiple sub-portions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system comprising:
a memory and a processor configured to execute instructions in the memory to implement a service manager module, the service manager module configured to; monitor progress of conversion of a document, the document split into multiple portions and submitted to a pool of nodes for conversion of respective portions; responsive to completion of conversion of a predefined amount of the document; split at least one of the multiple portions into a plurality of sub-portions, the at least one of the multiple portions being non-completed; and submit the plurality of sub-portions to the pool of nodes for conversion of each sub-portion. - View Dependent Claims (13, 14, 15, 16)
-
17. One or more computer-readable storage media having instructions that, responsive to execution by a computer, cause the computer to perform operations comprising:
-
monitoring progress of processing of a file that is split into a plurality of portions to be processed by a plurality of nodes; receiving results corresponding to one or more of the plurality of portions; based on the monitoring, identifying that one or more non-completed portions of the plurality of portions of the file remain; splitting the one or more non-completed portions into a plurality of sub-portions; and submitting the plurality of sub-portions to the plurality of nodes for processing. - View Dependent Claims (18, 19, 20)
-
Specification