Analytics platform for scalable distributed computations
First Claim
1. A method comprising:
- receiving a request to execute a distributed analytics workflow specification comprising multiple distributed computations to be performed across a plurality of data processing clusters associated with respective data zones;
for each of a plurality of analytic services to be executed in accordance with the distributed analytics workflow specification, determining if execution of the analytic service is authorized for the request;
for each of a plurality of datasets to be operated on by one or more of the analytic services in accordance with the distributed analytics workflow specification, determining if execution of the one or more analytic services on the dataset is authorized by its corresponding data zone for the request; and
responsive to each of the analytic services and each of the datasets being authorized for the request, initiating execution of the distributed computations of the distributed analytics workflow specification across the data processing clusters;
wherein the method is performed by an analytics platform comprising at least one processing device comprising a processor coupled to a memory.
7 Assignments
0 Petitions
Accused Products
Abstract
An analytics platform is configured to receive a request to execute a distributed analytics workflow specification comprising multiple distributed computations to be performed across a plurality of data processing clusters associated with respective data zones. For each of a plurality of analytic services to be executed in accordance with the distributed analytics workflow specification, the platform determines if execution of the analytic service is authorized for the request, and for each of a plurality of datasets to be operated on by one or more of the analytic services in accordance with the distributed analytics workflow specification, the platform determines if execution of the one or more analytic services on the dataset is authorized by its corresponding data zone for the request. Responsive to each of the analytic services and each of the datasets being authorized for the request, the platform initiates execution of the distributed computations of the distributed analytics workflow specification across the data processing clusters.
215 Citations
20 Claims
-
1. A method comprising:
-
receiving a request to execute a distributed analytics workflow specification comprising multiple distributed computations to be performed across a plurality of data processing clusters associated with respective data zones; for each of a plurality of analytic services to be executed in accordance with the distributed analytics workflow specification, determining if execution of the analytic service is authorized for the request; for each of a plurality of datasets to be operated on by one or more of the analytic services in accordance with the distributed analytics workflow specification, determining if execution of the one or more analytic services on the dataset is authorized by its corresponding data zone for the request; and responsive to each of the analytic services and each of the datasets being authorized for the request, initiating execution of the distributed computations of the distributed analytics workflow specification across the data processing clusters; wherein the method is performed by an analytics platform comprising at least one processing device comprising a processor coupled to a memory. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer program product comprising a non-transitory processor-readable storage medium having stored therein program code of one or more software programs, wherein the program code when executed by at least one processing device causes said at least one processing device:
-
to receive a request to execute a distributed analytics workflow specification comprising multiple distributed computations to be performed across a plurality of data processing clusters associated with respective data zones; for each of a plurality of analytic services to be executed in accordance with the distributed analytics workflow specification, to determine if execution of the analytic service is authorized for the request; for each of a plurality of datasets to be operated on by one or more of the analytic services in accordance with the distributed analytics workflow specification, to determine if execution of the one or more analytic services on the dataset is authorized by its corresponding data zone for the request; and responsive to each of the analytic services and each of the datasets being authorized for the request, to initiate execution of the distributed computations of the distributed analytics workflow specification across the data processing clusters. - View Dependent Claims (16, 17)
-
-
18. An apparatus comprising:
-
at least one processing device having a processor coupled to a memory; wherein said at least one processing device is configured; to receive a request to execute a distributed analytics workflow specification comprising multiple distributed computations to be performed across a plurality of data processing clusters associated with respective data zones; for each of a plurality of analytic services to be executed in accordance with the distributed analytics workflow specification, to determine if execution of the analytic service is authorized for the request; for each of a plurality of datasets to be operated on by one or more of the analytic services in accordance with the distributed analytics workflow specification, to determine if execution of the one or more analytic services on the dataset is authorized by its corresponding data zone for the request; and responsive to each of the analytic services and each of the datasets being authorized for the request, to initiate execution of the distributed computations of the distributed analytics workflow specification across the data processing clusters. - View Dependent Claims (19, 20)
-
Specification