×

Shuffle embedded distributed storage system supporting virtual merge and method thereof

  • US 10,135,926 B2
  • Filed: 10/21/2015
  • Issued: 11/20/2018
  • Est. Priority Date: 06/09/2015
  • Status: Active Grant
First Claim
Patent Images

1. A shuffle embedded distributed storage system supporting virtual merge, the system comprising:

  • a distributed shared storage configured to store a virtual merged file;

    a plurality of map servers connected to the distributed shared storage via a network, and configured to perform a map function and record a map result data computed as a result of the map function in an aligned format in the distributed shared storage by means of a map result file; and

    a plurality of reduce servers connected to the distributed shared storage and the map servers via the network for performing a reduce function on the map result files received from the virtual merged file of the distributed shared storage,wherein the virtual merged file comprises a list of the map result files generated by the plurality of map servers, and an identifier of one of the plurality of reduce servers to which the virtual merged file is to be transmitted, wherein the map result files are registered in the virtual merged file,wherein one or more of the plurality of map servers transmits an identifier of the virtual merged file to one or more of the plurality of the reduce servers, andwherein, in response to receiving a request for data reading from a selected one of the plurality of reduce servers, the distributed shared storage searches the virtual merged file having an identifier that is identical to the selected reduce server, and reads and aligns data of the map result files included in the searched virtual merged file consecutively, and transmits the aligned data to the selected reduce server without merging the map result files.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×