×

Generation of job flow objects in federated areas from data structure

  • US 10,380,185 B2
  • Filed: 12/18/2018
  • Issued: 08/13/2019
  • Est. Priority Date: 02/05/2016
  • Status: Active Grant
First Claim
Patent Images

1. An apparatus comprising a processor and a storage to store instructions that, when executed by the processor, cause the processor to perform operations comprising:

  • receive, by the processor and from a requesting device, a request to provide, within a federated area specified in the request, a set of objects that enable a performance of a job flow to perform multiple tasks of an analysis that is based on data and multiple formulae incorporated into a spreadsheet data structure specified in the request, wherein;

    the set of objects comprises at least one data object comprising data required as an input to the job flow, at least one task routine comprising executable instructions to perform a task of the multiple tasks when executed, and a job flow definition that specifies an order of performance of the multiple tasks;

    the data incorporated into the spreadsheet data structure is organized into at least one data table within the spreadsheet data structure;

    each data table is divisible into multiple data subparts that each comprise at least one row or at least one column of the data table;

    the multiple formulae are organized into at least one formula table within the spreadsheet data structure; and

    each formula of the multiple formulae specifies a task of the multiple tasks, and incorporates at least one indication of data required as input to perform the task and at least one indication of data that is generated as output when the task is performed;

    correlate each indication of data required as input to a subpart of a data table of the at least one data table;

    correlate each indication of data generated as output to a subpart of a data table of the at least one data table;

    among the multiple formulae, correlate the indications of data required as input to each formula to the indications of data generated as output to identify data dependencies among the multiple tasks, and to identify an error in dependency among at least two tasks of the multiple tasks;

    identify at least one pair of tasks of the multiple tasks that are able to be performed in parallel due to a lack of data dependencies therebetween;

    determine an order of performance of the multiple tasks based on the identification of the data dependencies and the at least one pair of tasks;

    for each data table of the at least one data table that includes at least one subpart that is correlated to an indication of data required as input, and that is not correlated to an indication of data generated as output, generate, within the specified federated area, a data object of the at least one data object that includes the data within the at least one subpart;

    generate, within the specified federated area, a job flow definition that specifies the order of performance of the multiple tasks, wherein;

    each task of the multiple tasks is specified with a unique flow task identifier of multiple flow task identifiers; and

    the job flow definition includes an indication of each identified pair of tasks able to be performed in parallel;

    for each task routine of the at least one task routine that is to perform a task of the multiple tasks when executed, generate, within the specified federated area, a corresponding macro data structure of multiple macro data structures, wherein each macro data structure comprises;

    the flow task identifier of the task performed by the corresponding task routine;

    a task routine identifier that uniquely identifies the corresponding task routine;

    indications of characteristics of at least one input interface for each input that is required to perform the task; and

    indications of characteristics of at least one output interface for each output that is generated when the task is performed; and

    in response to the identification of an error in dependency among at least two tasks of the multiple tasks, augment the macro data structure that corresponds to one task of the at least two tasks to include an indication of the error in dependency to enable an inclusion of a corresponding visual error indication in a visual representation of the corresponding task routine.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×