×

Usage reporting from a cloud-hosted, distributed system

  • US 9,317,395 B2
  • Filed: 12/15/2011
  • Issued: 04/19/2016
  • Est. Priority Date: 11/14/2011
  • Status: Active Grant
First Claim
Patent Images

1. At a tracker service in a cluster computing environment, a method of collecting usage data, the method comprising:

  • receiving a request from a first cluster deployment comprising a cluster deployment creator service, the request requesting an interval over which a plurality of cluster deployments, including the first cluster deployment as well as at least a second cluster deployment that is created by the first cluster deployment, are to report usage information in usage reports, the usage information defining at least (i) one or more first characteristics of at least one first job that is assigned to the first cluster deployment and that is being run on a plurality of first worker nodes in the first cluster deployment, and (ii) one or more second characteristics of at least one second job that is assigned to the second cluster deployment and that is being run on a plurality of second worker nodes in the second cluster deployment;

    in response to the request from the first cluster deployment, providing the interval to the first cluster deployment;

    receiving a plurality of usage reports according to the interval, including receiving at least;

    (i) a plurality of first usage reports from a plurality of first aggregator instances running in the first cluster deployment, each of the plurality of first usage reports including a first deployment identifier identifying the first cluster deployment and providing usage information for the at least one first job that is being run on the plurality of first worker nodes, and(ii) a plurality of second usage reports from a plurality of second aggregator instances running in the second cluster deployment, each of the plurality of second usage reports including a second deployment identifier identifying the second cluster deployment and providing usage information for the at least one second job that is run on the plurality of second worker nodes; and

    identifying duplicate data in the plurality of usage reports, including;

    based on the first deployment identifier, identifying first duplicate data among the plurality of first usage reports regarding the at least one first job, the first duplicate data having been sent by each of at least two of the plurality of first aggregator instances; and

    based on the second deployment identifier, identifying second duplicate data among the plurality of second usage reports regarding the at least one second job, the second duplicate data having been sent by each of at least two of the plurality of second aggregator instances.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×