×

Memory optimized data shuffle

  • US 10,642,520 B1
  • Filed: 04/18/2017
  • Issued: 05/05/2020
  • Est. Priority Date: 04/18/2017
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • maintaining a first data shuffle memory pool at a data shuffle writer node and a second data shuffle memory pool at a data shuffle reader node, wherein the data shuffle writer node and the data shuffle reader node are part of a set of multiple nodes of a distributed data processing system;

    performing an in-memory compression on at least a portion of a data set from the first data shuffle memory pool;

    performing a data shuffle operation on the at least a portion of the compressed data from the first shuffle memory pool, wherein the data shuffle operation maps different parts of the compressed data for transmission to different nodes of the distributed data processing system; and

    transmitting, in response to the data shuffle operation, the at least a portion of the compressed data from the first data shuffle memory pool to the second data shuffle memory pool in a peer-to-peer manner;

    wherein the distributed data processing system is implemented via one or more processing devices.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×