EFFICIENT IMPLEMENTATIONS FOR MAPREDUCE SYSTEMS
First Claim
1. Apparatus comprising:
- a processor configured to function as at least a first Reducer in a MapReduce system; and
a processor-readable storage medium storing processor-executable instructions that, when executed by the processor, cause the processor to perform a method comprising;
receiving a set of mapped [key, value] pairs output from a Mapper in the MapReduce system;
identifying, within the set of mapped [key, value] pairs, one or more [key, value] pairs for whose keys the first Reducer is not responsible; and
transferring the one or more identified [key, value] pairs to one or more other Reducers in the MapReduce system.
1 Assignment
0 Petitions
Accused Products
Abstract
In some embodiments, a processor configured to function as at least a first Reducer in a MapReduce system may receive a set of mapped [key, value] pairs output from a Mapper in the MapReduce system, identify within the set of mapped [key, value] pairs one or more [key, value] pairs for whose keys the first Reducer is not responsible, and transfer those [key, value] pairs to one or more other Reducers in the MapReduce system. In some embodiments, a system including at least one processor may receive a data packet including a set of mapped [key, value] pairs corresponding to a plurality of keys handled by a plurality of Reducers in a MapReduce system. For each mapped [key, value] pair, the system may identify the corresponding key and one of the Reducers responsible for that key, and provide the mapped [key, value] pair to the Reducer for processing.
27 Citations
22 Claims
-
1. Apparatus comprising:
-
a processor configured to function as at least a first Reducer in a MapReduce system; and a processor-readable storage medium storing processor-executable instructions that, when executed by the processor, cause the processor to perform a method comprising; receiving a set of mapped [key, value] pairs output from a Mapper in the MapReduce system; identifying, within the set of mapped [key, value] pairs, one or more [key, value] pairs for whose keys the first Reducer is not responsible; and transferring the one or more identified [key, value] pairs to one or more other Reducers in the MapReduce system. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
receiving, at a processor configured to function as at least a first Reducer in a MapReduce system, a set of mapped [key, value] pairs output from a Mapper in the MapReduce system; identifying, by the processor configured to function as at least the first Reducer, within the set of mapped [key, value] pairs, one or more [key, value] pairs for whose keys the first Reducer is not responsible; and transferring the one or more identified [key, value] pairs to one or more other Reducers in the MapReduce system. - View Dependent Claims (7, 8, 9, 10)
-
-
11. Apparatus comprising:
-
at least one processor; and at least one processor-readable storage medium storing processor-executable instructions that, when executed by the at least one processor, cause the at least one processor to perform a method comprising; receiving a data packet including a set of mapped [key, value] pairs corresponding to a plurality of keys handled by a plurality of Reducers in a MapReduce system; and for each mapped [key, value] pair in the set of mapped [key, value] pairs; identifying a key corresponding to the respective mapped [key, value] pair; identifying a Reducer of the plurality of Reducers responsible for the identified key; and providing the respective mapped [key, value] pair to the identified Reducer for processing. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A method comprising:
-
receiving, at at least one processor, a data packet including a set of mapped [key, value] pairs corresponding to a plurality of keys handled by a plurality of Reducers in a MapReduce system; and for each mapped [key, value] pair in the set of mapped [key, value] pairs; identifying, via the at least one processor, a key corresponding to the respective mapped [key, value] pair; identifying, via the at least one processor, a Reducer of the plurality of Reducers responsible for the identified key; and providing the respective mapped [key, value] pair to the identified Reducer for processing. - View Dependent Claims (18, 19, 20, 21, 22)
-
Specification