Data analytics computing resource provisioning based on computed cost and time parameters for proposed computing resource configurations
First Claim
1. A method comprising:
- obtaining, via one or more processing elements associated with a computing system, a work package defining a data analytic plan for analyzing a given data set associated with a given data problem, the work package being generated based on an automated data analytics lifecycle, wherein the work package is associated with a priority level corresponding to a requester, and wherein the computing system is configured to dynamically change a set of parameters associated with the work package;
analyzing, via the one or more processing elements, the data analytic plan and the given data set;
mapping, via the one or more processing elements, one or more algorithms and one or more data set characteristics associated with the given data set onto one or more tiers of the computing system to generate one or more computing resource configuration options for implementing the data analytic plan;
computing, via the one or more processing elements, based on at least a portion of results of the analysis and the mapping, at least one of a cost parameter and a time parameter for each of the one or more computing resource configuration options, wherein the cost parameter comprises a computing power metric and the time parameter comprises a computing time metric;
restricting one or more operations of the one or more computing resource configuration options based on at least one security policy and at least one privacy policy, wherein the at least one security policy and the at least one privacy policy each specify one or more data access rules;
presenting, via the one or more processing elements, at least one of the cost parameter and the time parameter for each of the one or more computing resource configuration options;
selecting, via the one or more processing elements, one of the one or more computing resource configuration options based on at least one of the cost parameter and the time parameter, and the priority level of the work package; and
provisioning, via the one or more processing elements, a computing resource infrastructure based on the selected computing resource configuration option.
9 Assignments
0 Petitions
Accused Products
Abstract
A work package is obtained defining a data analytic plan for analyzing a given data set associated with a given data problem. The work package is generated in accordance with an automated data analytics lifecycle. The data analytic plan and the given data set are analyzed. Based on at least a portion of results of the analysis, at least one of a cost parameter and a time parameter is computed for one or more computing resource configurations proposed for implementing the data analytic plan. One of the one or more computing resource configurations is selected based on at least one of the cost parameter and the time parameter. A computing resource infrastructure is caused to be provisioned in accordance with the selected computing resource configuration. The above steps are performed on one or more processing elements associated with a computing system.
43 Citations
20 Claims
-
1. A method comprising:
-
obtaining, via one or more processing elements associated with a computing system, a work package defining a data analytic plan for analyzing a given data set associated with a given data problem, the work package being generated based on an automated data analytics lifecycle, wherein the work package is associated with a priority level corresponding to a requester, and wherein the computing system is configured to dynamically change a set of parameters associated with the work package; analyzing, via the one or more processing elements, the data analytic plan and the given data set; mapping, via the one or more processing elements, one or more algorithms and one or more data set characteristics associated with the given data set onto one or more tiers of the computing system to generate one or more computing resource configuration options for implementing the data analytic plan; computing, via the one or more processing elements, based on at least a portion of results of the analysis and the mapping, at least one of a cost parameter and a time parameter for each of the one or more computing resource configuration options, wherein the cost parameter comprises a computing power metric and the time parameter comprises a computing time metric; restricting one or more operations of the one or more computing resource configuration options based on at least one security policy and at least one privacy policy, wherein the at least one security policy and the at least one privacy policy each specify one or more data access rules; presenting, via the one or more processing elements, at least one of the cost parameter and the time parameter for each of the one or more computing resource configuration options; selecting, via the one or more processing elements, one of the one or more computing resource configuration options based on at least one of the cost parameter and the time parameter, and the priority level of the work package; and provisioning, via the one or more processing elements, a computing resource infrastructure based on the selected computing resource configuration option. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer program product comprising a non-transitory processor-readable storage medium having encoded therein executable code of one or more software programs, wherein the one or more software programs when executed by one or more processing elements of a computing system implement the steps of:
-
obtaining a work package defining a data analytic plan for analyzing a given data set associated with a given data problem, the work package being generated based on an automated data analytics lifecycle, wherein the work package is associated with a priority level corresponding to a requester, and wherein the computing system is configured to dynamically change a set of parameters associated with the work package; analyzing the data analytic plan and the given data set; mapping one or more algorithms and one or more data set characteristics associated with the given data set onto one or more tiers of the computing system to generate one or more computing resource configuration options for implementing the data analytic plan; computing, based on at least a portion of results of the analysis and the mapping, at least one of a cost parameter and a time parameter for each of the one or more computing resource configuration options, wherein the cost parameter comprises a computing power metric and the time parameter comprises a computing time metric; restricting one or more operations of the one or more computing resource configuration options based on at least one security policy and at least one privacy policy, wherein the at least one security policy and the at least one privacy policy each specify one or more data access rules; presenting at least one of the cost parameter and the time parameter for each of the one or more computing resource configuration options; selecting one of the one or more computing resource configuration options based on at least one of the cost parameter and the time parameter, and the priority level of the work package; and provisioning a computing resource infrastructure based on the selected computing resource configuration option.
-
-
12. An apparatus comprising:
-
a memory; and at least one processor operatively coupled to the memory and configured to; obtain a work package defining a data analytic plan for analyzing a given data set associated with a given data problem, the work package being generated based on an automated data analytics lifecycle, wherein the work package is associated with a priority level corresponding to a requester, and wherein the computing system is configured to dynamically change a set of parameters associated with the work package; analyze the data analytic plan and the given data set; map one or more algorithms and one or more data set characteristics associated with the given data set onto one or more tiers of the computing system to generate one or more computing resource configuration options for implementing the data analytic plan; compute, based on at least a portion of results of the analysis and the mapping, at least one of a cost parameter and a time parameter for each of the one or more computing resource configuration options, wherein the cost parameter comprises a computing power metric and the time parameter comprises a computing time metric; restrict one or more operations of the one or more computing resource configuration options based on at least one security policy and at least one privacy policy, wherein the at least one security policy and the at least one privacy policy each specify one or more data access rules; present at least one of the cost parameter and the time parameter for each of the one or more computing resource configuration options; select one of the one or more computing resource configuration options based on at least one of the cost parameter and the time parameter, and the priority level of the work package; and provision a computing resource infrastructure based on the selected computing resource configuration option. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification