×

DISTRIBUTED COMPUTING SYSTEM FOR LARGE-SCALE DATA HANDLING

  • US 20100162230A1
  • Filed: 12/24/2008
  • Published: 06/24/2010
  • Est. Priority Date: 12/24/2008
  • Status: Abandoned Application
First Claim
Patent Images

1. a system for processing data on a distributed computing environment, the system comprising:

  • a input data storage module containing input data from a weblog;

    a map module in communication with the input data storage module to receive a split of the input data and configured to execute mapper code for manipulating the input data to generate mapped data.a reduce module in communication with the map module to receive the map module to receive the mapped data, the reduce module being configured to execute reducer code for analyzing the mapped data and generate result data.a result data storage module in communication with the reduce module to receive the result data from the reduce module.a master module for coordinating the selection, set-up, and data flow of the map module and the reduce module, the master module loading the mapper code onto the mapper module and the reducer code onto the reducer module; and

    a central storage module containing a mapper executable file and a reducer executable file, wherein the mapper code accesses the central storage module and loads the mapper executable file onto the mapper module and the reducer code loads the reducer executable file onto the reducer module.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×