×

Data analytics lifecycle processes

  • US 9,262,493 B1
  • Filed: 12/27/2012
  • Issued: 02/16/2016
  • Est. Priority Date: 12/27/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • defining a data analytic plan for analyzing a given data set associated with a given data problem associated with a data analytics lifecycle;

    obtaining a test data set and a training data set from the given data set associated with the given data problem;

    executing at least one model to confirm an adequacy of the at least one model for the data analytic plan by fitting the at least one model on the training data set and evaluating the at least one model fitted on the training data set against the test data set, wherein the evaluation comprises assessing a validity of the at least one model and a validity of results of the execution of the at least one model on the test data set;

    refining the at least one model based on the assessment;

    conditioning at least a portion of raw data in the given data set to generate conditioned data;

    creating an analytics environment in which the executing, evaluating and conditioning steps are performed, the analytics environment comprising parameters including at least a capacity and a bandwidth of the analytics environment; and

    dynamically changing the parameters in response to the refining step to include parameters to perform additional executing and evaluating steps on refinements of the at least one model;

    wherein the step of dynamically changing the parameters is performed such that the data analytics lifecycle is configured to continue from a point in the lifecycle where the parameters were changed;

    wherein the execution of the at least one model is performed prior to implementation of the data analytic plan in a destination environment;

    wherein the training data set is used to train the at least one model and the test data set is used to determine the accuracy of the at least one model fitted on the training data set; and

    wherein the defining, obtaining, executing, refining, conditioning, creating and dynamically changing steps are performed on one or more processing elements associated with a computing system and automate at least part of the data analytics lifecycle.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×