×

Scalable distributed computations utilizing multiple distinct clouds

  • US 10,348,810 B1
  • Filed: 08/22/2017
  • Issued: 07/09/2019
  • Est. Priority Date: 04/06/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • initiating distributed computations across a plurality of data processing clusters associated with respective data zones; and

    combining local processing results of the distributed computations from respective ones of the data processing clusters;

    the data processing clusters being configured to perform respective portions of the distributed computations by processing data local to their respective data zones utilizing at least one local data structure configured to support at least one computational framework;

    a first one of data processing clusters being implemented in a first cloud of a first type provided by a first cloud service provider;

    at least a second one of the data processing clusters being implemented in a second cloud of a second type different than the first type, provided by a second cloud service provider different than the first cloud service provider;

    wherein the plurality of data processing clusters associated with the respective data zones are organized in accordance with a global computation graph for performance of the distributed computations and wherein the global computation graph comprises a plurality of nodes corresponding to respective ones of the data processing clusters and further wherein the plurality of nodes are arranged in multiple levels each including at least one of the nodes;

    wherein a global data structure is organized in levels with different levels of the global data structure corresponding to respective ones of the levels of the global computation graph and wherein a given one of the levels of the global data structure comprises local processing results generated by nodes of the corresponding level in the global computation graph;

    wherein the local processing results of the distributed computations from respective ones of the data processing clusters are combined utilizing the global data structure configured based at least in part on the at least one local data structure in order to produce global processing results of the distributed computations; and

    wherein the method is performed by at least one processing device comprising a processor coupled to a memory.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×