Usage reporting from a cloud-hosted, distributed system
First Claim
Patent Images
1. At a tracker service in a cluster computing environment, a method of collecting usage data, the method comprising:
- receiving a request from a first cluster deployment comprising a cluster deployment creator service, the request requesting an interval over which a plurality of cluster deployments, including the first cluster deployment as well as at least a second cluster deployment that is created by the first cluster deployment, are to report usage information in usage reports, the usage information defining at least (i) one or more first characteristics of at least one first job that is assigned to the first cluster deployment and that is being run on a plurality of first worker nodes in the first cluster deployment, and (ii) one or more second characteristics of at least one second job that is assigned to the second cluster deployment and that is being run on a plurality of second worker nodes in the second cluster deployment;
in response to the request from the first cluster deployment, providing the interval to the first cluster deployment;
receiving a plurality of usage reports according to the interval, including receiving at least;
(i) a plurality of first usage reports from a plurality of first aggregator instances running in the first cluster deployment, each of the plurality of first usage reports including a first deployment identifier identifying the first cluster deployment and providing usage information for the at least one first job that is being run on the plurality of first worker nodes, and(ii) a plurality of second usage reports from a plurality of second aggregator instances running in the second cluster deployment, each of the plurality of second usage reports including a second deployment identifier identifying the second cluster deployment and providing usage information for the at least one second job that is run on the plurality of second worker nodes; and
identifying duplicate data in the plurality of usage reports, including;
based on the first deployment identifier, identifying first duplicate data among the plurality of first usage reports regarding the at least one first job, the first duplicate data having been sent by each of at least two of the plurality of first aggregator instances; and
based on the second deployment identifier, identifying second duplicate data among the plurality of second usage reports regarding the at least one second job, the second duplicate data having been sent by each of at least two of the plurality of second aggregator instances.
2 Assignments
0 Petitions
Accused Products
Abstract
Collecting usage data in a cluster computing environment. A method includes at a tracker service receiving a request from an at least partially cloud based deployment for an interval for the deployment to report usage information in usage reports. The usage information includes information defining how software in a deployment is used. In response to the request from the deployment, the method further includes, the tracker service providing an interval to the deployment. The method further includes at the tracker service, receiving usage reports from the deployment according to the provided interval.
-
Citations
17 Claims
-
1. At a tracker service in a cluster computing environment, a method of collecting usage data, the method comprising:
-
receiving a request from a first cluster deployment comprising a cluster deployment creator service, the request requesting an interval over which a plurality of cluster deployments, including the first cluster deployment as well as at least a second cluster deployment that is created by the first cluster deployment, are to report usage information in usage reports, the usage information defining at least (i) one or more first characteristics of at least one first job that is assigned to the first cluster deployment and that is being run on a plurality of first worker nodes in the first cluster deployment, and (ii) one or more second characteristics of at least one second job that is assigned to the second cluster deployment and that is being run on a plurality of second worker nodes in the second cluster deployment; in response to the request from the first cluster deployment, providing the interval to the first cluster deployment; receiving a plurality of usage reports according to the interval, including receiving at least; (i) a plurality of first usage reports from a plurality of first aggregator instances running in the first cluster deployment, each of the plurality of first usage reports including a first deployment identifier identifying the first cluster deployment and providing usage information for the at least one first job that is being run on the plurality of first worker nodes, and (ii) a plurality of second usage reports from a plurality of second aggregator instances running in the second cluster deployment, each of the plurality of second usage reports including a second deployment identifier identifying the second cluster deployment and providing usage information for the at least one second job that is run on the plurality of second worker nodes; and identifying duplicate data in the plurality of usage reports, including; based on the first deployment identifier, identifying first duplicate data among the plurality of first usage reports regarding the at least one first job, the first duplicate data having been sent by each of at least two of the plurality of first aggregator instances; and based on the second deployment identifier, identifying second duplicate data among the plurality of second usage reports regarding the at least one second job, the second duplicate data having been sent by each of at least two of the plurality of second aggregator instances. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer program product comprising one or more hardware storage devices having stored thereon computer executable instructions that, when executed by one or more processors of a computer system, cause the computer system to collect usage data in a cluster computing environment, including the following:
-
receiving a request from a first cluster deployment comprising a cluster deployment creator service, the request requesting an interval over which a plurality of cluster deployments, including the first cluster deployment as well as at least a second cluster deployment that is created by the first cluster deployment, are to report usage information in usage reports, the usage information defining at least (i) one or more first characteristics of at least one first job that is assigned to the first cluster deployment and that is being run on a plurality of first worker nodes in the first cluster deployment, and (ii) one or more second characteristics of at least one second job that is assigned to the second cluster deployment and that is being run on a plurality of second worker nodes in the second cluster deployment; in response to the request from the first cluster deployment, providing the interval to the first cluster deployment; receiving a plurality of usage reports according to the interval, including receiving at least; (i) a plurality of first usage reports from a plurality of first aggregator instances running in the first cluster deployment each of the plurality of first usage reports including a first deployment identifier identifying the first cluster deployment and providing usage information for the at least one first job that is being run on the plurality of first worker nodes, and (ii) a plurality of second usage reports from a plurality of second aggregator instances running in the second cluster deployment, each of the plurality of second usage reports including a second deployment identifier identifying the second cluster deployment and providing usage information for the at least one second job that is run on the plurality of second worker nodes; and identifying, duplicate data in the plurality of usage reports, including; based on the first deployment identifier, identifying first duplicate data among the plurality of first usage reports regarding the at least one first job, the first duplicate data having been sent by each of at least two of the plurality of first aggregator instances; and based on the second deployment identifier, identifying second duplicate data among the plurality of second usage reports regarding the at least one second job, the second duplicate data having been sent by each of at least two of the plurality of second aggregator instances. - View Dependent Claims (15)
-
-
16. A computer system, comprising:
-
one or more hardware processors; and one or more hardware storage devices having stored thereon computer executable instructions representing a tracker service, and wherein the tracker services is configured to perform a least the following; receive a request a first cluster deployment comprising a cluster deployment creator service, the request requesting an interval over which a plurality of cluster deployments, including the first cluster deployment as well as at least a second cluster deployment that is created by the first cluster deployment, are to report usage information in usage reports, the usage information defining at least (i) one or more first characteristics of at least one first job that is assigned to the first cluster deployment and that is being run on a plurality of first worker nodes in the first cluster deployment, and (ii) one or more second characteristics of at least one second job that is assigned to the second cluster deployment and that is being run on a plurality of second worker nodes in the second cluster deployment; in response to the request from the first cluster deployment, provide the interval to the first cluster deployment; receive a plurality of usage reports according to the interval, including receiving at least; (i) a plurality of first usage reports from a plurality of first aggregator instances running in the first cluster deployment, each of the plurality of first usage reports including a first deployment identifier identifying the first cluster deployment and providing usage information for the at least one first job that is being run on the plurality of first worker nodes, and (ii) a plurality of second usage reports from a plurality of second aggregator instances running in the second cluster deployment, each of the plurality of second usage reports including a second deployment identifier identifying the second cluster deployment and providing usage information for the at least one second job that is run on the plurality of second worker nodes; and identify duplicate data in the plurality of usage reports, including; based on the first deployment identifier, identifying first duplicate data among the plurality of first usage reports regarding the at least one first job, the first duplicate data having been sent by each of at least two of the plurality of first aggregator instances; and based on the second deployment identifier, identifying second duplicate data among the plurality of second usage reports regarding the at least one second job, the second duplicate data having been sent by each of at least two of the plurality of second aggregator instances. - View Dependent Claims (17)
-
Specification