×

High performance hadoop with new generation instances

  • US 10,606,478 B2
  • Filed: 10/22/2015
  • Issued: 03/31/2020
  • Est. Priority Date: 10/22/2014
  • Status: Active Grant
First Claim
Patent Images

1. A distributed computing system comprising a plurality of computational clusters, each computational cluster utilized in a MapReduce model and comprising a plurality of compute optimized instances, each instance comprising local instance data storage and in communication with reserved disk storage, wherein processing hierarchy is configured to use local instance data storage unless there is insufficient space on the local instance data storage, thereby providing priority to local instance data storage before providing priority to reserved disk storage,wherein intermediate data files within the MapReduce model are stored at least in part on the compute optimized instances comprising a list of directories residing on the local instance data storage and a list of directories residing on the reserved disk storage, wherein the directories on the reserved disk storage are accessed when a processing request cannot be handled by the local instance data storage;

  • andwherein the distributed computer system is configured to auto-scale;

    upon adding an instance to a cluster, mounting a reserved disk storage associated with the instance is delayed until disk utilization on a cluster exceeds a predetermined threshold; and

    upon terminating an instance from a cluster, terminating any reserved disk storage associated with the instance.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×