×

Data-driven automation mechanism for analytics workload distribution

  • US 10,374,968 B1
  • Filed: 12/30/2016
  • Issued: 08/06/2019
  • Est. Priority Date: 12/30/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving a data processing request in a first workload distribution node configured to communicate with a plurality of distributed data processing clusters over at least one network;

    identifying particular ones of the plurality of distributed data processing clusters that are suitable for handling at least a portion of the data processing request;

    separating the data processing request into a plurality of data tasks;

    providing each of the data tasks to one or more of the identified distributed data processing clusters;

    receiving for each of the data tasks an indication from one or more of the distributed data processing clusters of its ability to perform the data task;

    assigning the data tasks to one or more of the distributed data processing clusters responsive to the received indications;

    receiving results of performance of the data tasks from the one or more assigned distributed data processing clusters; and

    aggregating the results into a response that is returned to a source of the data processing request;

    wherein the source of the data processing request comprises another workload distribution node and further wherein the data processing request comprises a given data task of a higher-level data processing request separated into a plurality of data tasks by the other workload distribution node for handling by the first workload distribution node and one or more additional workload distribution nodes;

    wherein the first workload distribution node comprises an analytics workload distribution node and the given data task of the higher-level data processing request comprises a request to process at least a portion of an analytics workload using at least a subset of the plurality of distributed data processing clusters;

    wherein the data tasks are assigned and the corresponding results are aggregated in a manner that ensures satisfaction of one or more privacy policies of the one or more distributed data processing clusters;

    wherein the method is performed by at least one processing device comprising a processor coupled to a memory; and

    wherein said at least one processing device implements the first workload distribution node.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×