×

Beacon-based distributed data processing platform

  • US 10,277,668 B1
  • Filed: 12/29/2015
  • Issued: 04/30/2019
  • Est. Priority Date: 04/06/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • initiating a first application in a first one of a plurality of distributed processing nodes, wherein the plurality of distributed processing nodes comprise a plurality of clusters associated with respective data zones, each of the clusters being configured to perform processing operations utilizing local data resources locally accessible within its corresponding data zone, wherein at least a subset of the plurality of clusters comprise respective Yet Another Resource Negotiator (YARN) clusters;

    determining a plurality of data resources to be utilized by at least a portion of the first application, the plurality of data resources comprising at least one data resource not locally accessible within a first data zone associated with the first distributed processing node;

    responsive to initiation of the first application, identifying a plurality of beacon entities to be contacted to determine additional ones of the plurality of distributed processing nodes having associated data zones which include the at least one data resource not locally accessible within the first data zone associated with the first distributed processing node, wherein the data zones are associated with respective ones of the beacon entities and further wherein the beacon entities comprise at least one of;

    (i) respective participants in a beacon network, and (ii) respective geographically-distributed regional data centers each configured to perform analytics processing utilizing locally accessible data resources of a corresponding data zone;

    for each of at least a subset of the identified beacon entities, initiating an additional application in an additional one of the plurality of distributed processing nodes, wherein each of the beacon entities is configured to perform processing operations associated with at least one of the first and one or more additional applications utilizing data resources locally accessible within a corresponding data zone;

    aggregating processing results from the first and one or more additional processing nodes; and

    providing the aggregated processing results to a client;

    wherein the processing results from the first and one or more additional processing nodes preserve at least one policy of those nodes in the respective local data resources wherein the policy relates to at least one of privacy, security, governance, risk and compliance;

    wherein the policy specifies;

    that processing results generated by a given one of the additional applications initiated in a given one of the additional processing nodes utilizing data resources locally accessible within a given data zone may be transmitted to the first processing node; and

    that the data resources locally accessible within the given data zone associated with the given additional processing node utilized to obtain the transmitted processing results are not permitted to be transmitted to the first processing node; and

    wherein the method is implemented by at least one processing device comprising a processor coupled to a memory.

View all claims
  • 11 Assignments
Timeline View
Assignment View
    ×
    ×