×

Behaviorally consistent cluster-wide data wrangling based on locally processed sampled data

  • US 9,785,698 B2
  • Filed: 12/31/2014
  • Issued: 10/10/2017
  • Est. Priority Date: 12/31/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • selecting, at a local client device, a first plurality of raw data from a second plurality of raw data, the second plurality of raw data being stored remote from the local client device and accessible by a remote device;

    receiving the first plurality of raw data at the local client device;

    selecting, from a library of data wrangling operations at the local client device, a plurality of data wrangling operations to perform on the first plurality of raw data;

    instantiating, at the local client device, a first data wrangling module operating in a first mode selected from a first plurality of modes, the first mode being selected based on computing resources available to the local client device;

    applying, using at least one hardware processor of the local client device, the plurality of data wrangling operations to the first plurality of raw data using the first mode of the first data wrangling module to obtain a first plurality of structured data;

    causing presentation, via a user interface of the local client device, of the first plurality of structured data;

    receiving, via the user interface of the local client device, after the causing of the presentation of the first plurality of structured data, an input indicating approval of the first plurality of structured data; and

    sending, in response to receiving the input, the selection of the plurality of data wrangling operations to the remote device, the remote device being configured to;

    instantiate a second data wrangling module operating in a second mode selected from a plurality of modes; and

    apply the selected plurality of data wrangling operations to the second plurality of raw data using the second mode of the second data wrangling module to obtain a second plurality of structured data, the second plurality of structured data having an expected organization based on the first plurality of structured data.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×