×

Techniques for configuring and validating a data pipeline deployment

  • US 10,534,595 B1
  • Filed: 05/11/2018
  • Issued: 01/14/2020
  • Est. Priority Date: 06/30/2017
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • receiving a template that defines a plurality of job definitions;

    wherein each particular job definition of the plurality of job definitions corresponds to a particular data processing job, and wherein each particular job definition comprises;

    a code identifier that identifies code for processing the particular data processing job;

    a plurality of dataset dependency identifiers that identify a plurality of input datasets for the particular data processing job;

    a plurality of configuration parameters for processing the particular data processing job;

    for each particular job definition of the plurality of job definitions;

    based on the template, causing to be displayed a user interface for receiving a plurality of configuration parameter values for the plurality of configuration parameters for the particular job definition;

    receiving the plurality of configuration parameter values for the particular job definition;

    executing the corresponding particular data processing job for the particular job definition by executing the code for processing the particular data processing job, by using the input datasets for the particular data processing job and the plurality of configuration parameter values;

    in response to a command to perform a validation of a target data processing job that corresponds to a target job definition of the plurality of job definitions,executing the target data processing job for the target job definition by executing the code for processing the target data processing job, by using the input datasets for the target data processing job and the plurality of configuration parameter values for the target data processing job;

    applying one or more validation criteria to the target data processing job to generate a validation value that indicates a metric of accuracy of the plurality of configuration parameter values for the target data processing job;

    wherein the method is performed using one or more processors.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×