Methods and systems for parallel distributed computation
First Claim
1. A machine implemented method, comprising:
- configuring a shared computing task among a plurality of computing nodes each computing node executing an application for performing the shared computing task;
generating intermediate data by each of the plurality of computing nodes during a mapping process for performing the shared computing task;
capturing the intermediate data for each of the plurality of computing nodes and transferring the intermediate data of each of the plurality of computing nodes to a shared storage device;
storing the intermediate data for each of the plurality of computing nodes at a key-value data store at the shared storage device that enables searching for the intermediate data of any node at the shared storage device using a key-value pair; and
using the key-value pair to obtain the intermediate data from the shared storage device when needed for a reduce process of the shared computing task;
wherein the mapping process and the reduce process are executed by a processor coupled to a memory device.
7 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for performing a shared computing task are provided. The method includes configuring a shared computing task among a plurality of computing nodes each executing an application for performing the shared computing task; generating intermediate data by each of the plurality of computing nodes during a mapping process for performing the shared computing task; capturing the intermediate data for each of the plurality of computing nodes; storing the intermediate data for each of the plurality of computing nodes at a key-value data store at a shared storage device that enables searching for the intermediate data using a key-value pair; and using the key-value pair to obtain the intermediate data when needed for a reduce process of the shared computing task.
5 Citations
20 Claims
-
1. A machine implemented method, comprising:
-
configuring a shared computing task among a plurality of computing nodes each computing node executing an application for performing the shared computing task; generating intermediate data by each of the plurality of computing nodes during a mapping process for performing the shared computing task; capturing the intermediate data for each of the plurality of computing nodes and transferring the intermediate data of each of the plurality of computing nodes to a shared storage device; storing the intermediate data for each of the plurality of computing nodes at a key-value data store at the shared storage device that enables searching for the intermediate data of any node at the shared storage device using a key-value pair; and using the key-value pair to obtain the intermediate data from the shared storage device when needed for a reduce process of the shared computing task;
wherein the mapping process and the reduce process are executed by a processor coupled to a memory device. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory, machine readable storage medium having stored thereon instructions for performing a method, comprising machine executable code which when executed by at least one machine, causes the machine to:
-
configure a shared computing task among a plurality of computing nodes each computing node executes an application for performing the shared computing task; generate intermediate data by each of the plurality of computing nodes during a mapping process for performing the shared computing task; capture the intermediate data for each of the plurality of computing nodes and transfer the intermediate data of each of the plurality of computing nodes to a shared storage device; store the intermediate data for each of the plurality of computing nodes at a key-value data store at the shared storage device that enables searching for the intermediate data of any node at the shared storage device using a key-value pair; and use the key-value pair to obtain the intermediate data from the shared storage device when needed for a reduce process of the shared computing task;
wherein the mapping process and the reduce process are executed by a processor coupled to a memory device. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system, comprising:
-
a memory containing machine readable medium comprising machine executable code having stored thereon instructions; and a processor module coupled to the memory, the processor module configured to execute the machine executable code to; configure a shared computing task among a plurality of computing nodes each computing node executes an application for performing the shared computing task; generate intermediate data by each of the plurality of computing nodes during a mapping process for performing the shared computing task; capture the intermediate data for each of the plurality of computing nodes and transfer the intermediate data of each of the plurality of computing nodes to a shared storage device; store the intermediate data for each of the plurality of computing nodes at a key-value data store at the shared storage device that enables searching for the intermediate data of any node at the shared storage device using a key-value pair; and use the key-value pair to obtain the intermediate data from the shared storage device when needed for a reduce process of the shared computing task;
wherein the mapping process and the reduce process are executed by a processor coupled to a memory device. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification