DETERMINING STORAGE TIERS FOR PLACEMENT OF DATA SETS DURING EXECUTION OF TASKS IN A WORKFLOW
First Claim
1. A computer program product for assigning tasks to storage tiers to store data sets processed by the tasks, wherein the computer program product comprises a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause operations, the operations comprising:
- processing a representation of a workflow execution pattern of tasks for a job indicating a dependency of the tasks and data sets operated on by the tasks;
determining an assignment of the data sets for the tasks to a plurality of the storage tiers based on the dependency of the tasks indicated in the workflow execution pattern, wherein a higher performing storage tier includes faster access storage devices than a relatively lower performing storage tier; and
scheduling a moving of a subject data set of the data sets operated on by a subject task of the tasks that is subject to an event to an assigned storage tier indicated in the assignment for the subject task subject, wherein the moving of the subject data set is scheduled to be performed in response to the event with respect to the subject task.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided are a computer program product, system, and method for determining storage tiers for placement of data sets during execution of tasks in a workflow. A representation of a workflow execution pattern of tasks for a job indicates a dependency of the tasks and data sets operated on by the tasks. A determination is made of an assignment of the data sets for the tasks to a plurality of the storage tiers based on the dependency of the tasks indicated in the workflow execution pattern. A moving is scheduled of a subject data set of the data sets operated on by a subject task of the tasks that is subject to an event to an assigned storage tier indicated in the assignment for the subject task subject. The moving of the data set is scheduled to be performed in response to the event with respect to the subject task.
-
Citations
23 Claims
-
1. A computer program product for assigning tasks to storage tiers to store data sets processed by the tasks, wherein the computer program product comprises a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause operations, the operations comprising:
-
processing a representation of a workflow execution pattern of tasks for a job indicating a dependency of the tasks and data sets operated on by the tasks; determining an assignment of the data sets for the tasks to a plurality of the storage tiers based on the dependency of the tasks indicated in the workflow execution pattern, wherein a higher performing storage tier includes faster access storage devices than a relatively lower performing storage tier; and scheduling a moving of a subject data set of the data sets operated on by a subject task of the tasks that is subject to an event to an assigned storage tier indicated in the assignment for the subject task subject, wherein the moving of the subject data set is scheduled to be performed in response to the event with respect to the subject task. - View Dependent Claims (2, 3, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
4-5. -5. (canceled)
-
14. A system coupled to a plurality of storage tiers, comprising:
-
a plurality of computational nodes; and a computer readable storage medium having program instructions that when executed by the computational nodes perform operations, the operations comprising; processing a representation of a workflow execution pattern of tasks for a job indicating a dependency of the tasks and data sets operated on by the tasks; determining an assignment of the data sets for the tasks to a plurality of the storage tiers based on the dependency of the tasks indicated in the workflow execution pattern, wherein a higher performing storage tier includes faster access storage devices than a relatively lower performing storage tier; and scheduling a moving of a subject data set of the data sets operated on by a subject task of the tasks that is subject to an event to an assigned storage tier indicated in the assignment for the subject task subject, wherein the moving of the subject data set is scheduled to be performed in response to the event with respect to the subject task. - View Dependent Claims (17, 18)
-
-
15-16. -16. (canceled)
-
19. A method tasks to storage tiers to store data sets processed by the tasks, comprising:
-
processing a representation of a workflow execution pattern of tasks for a job indicating a dependency of the tasks and data sets operated on by the tasks; determining an assignment of the data sets for the tasks to a plurality of the storage tiers based on the dependency of the tasks indicated in the workflow execution pattern, wherein a higher performing storage tier includes faster access storage devices than a relatively lower performing storage tier; and scheduling a moving of a subject data set of the data sets operated on by a subject task of the tasks that is subject to an event to an assigned storage tier indicated in the assignment for the subject task subject, wherein the moving of the subject data set is scheduled to be performed in response to the event with respect to the subject task. - View Dependent Claims (22, 23)
-
-
20-21. -21. (canceled)
Specification