×

Cloud process for rapid data investigation and data integrity analysis

  • US 10,367,888 B2
  • Filed: 09/20/2017
  • Issued: 07/30/2019
  • Est. Priority Date: 10/03/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving summary statistics computed by at least executing one or more analytical processes on a dataset stored in parts across a set of memory based compute nodes, each compute node finding partial statistics of a data part stored on the respective compute node, the partial statistics representative of a respective data part;

    storing the summary statistics in a random access memory associated with a server computer, the random access memory being accessible by at least one of the compute nodes, the summary statistics being a combination of the partial statistics and representative of a full dataset;

    identifying, for pre-model building data understanding, outlier data by comparing subsets of data in the dataset, the identified outlier data accessible to a predictive model;

    generating a graphical representation of at least some summary statistics stored in the random access memory; and

    formatting the graphical representation of at least some summary statistics for transmission to and display by one or more client computers.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×