×

Step editor for data preparation

  • US 10,642,815 B2
  • Filed: 10/14/2015
  • Issued: 05/05/2020
  • Est. Priority Date: 10/14/2015
  • Status: Active Grant
First Claim
Patent Images

1. A system, comprising:

  • one or more processors configured to;

    display, via a user interface, a first set of sequenced data preparation operations to be sequentially performed on a set of data, the first set of sequenced data preparation operations displayed via the user interface comprising at least one of;

    clustering, joining, appending, sorting, uppercase, lowercase, filtering, deduplicating, grouping, adding or removing columns, adding or removing rows, pivoting, and/or depivoting;

    receive, via the user interface, an indication of a user input, the user input received via the user interface comprising an indication to mute a data preparation operation in the first set of sequenced data preparation operations displayed via the user interface;

    based at least in part on the user input comprising the indication to mute the data preparation operation in the first set of sequenced data preparation operations displayed via the user interface, determine a second set of sequenced data preparation operations, wherein the second set of sequenced data preparation operations comprises a version of the first set of sequenced data preparation operations in which the muted data preparation operation is excluded;

    generate a signature based at least in part on the second set of sequenced data preparation operations comprising the version of the first set of sequenced data preparation operations in which the muted data preparation operation is excluded;

    use the generated signature to determine that there exists a cached result matching to at least a portion of the second set of sequenced data preparation operations;

    based at least in part on determining that there exists the cached result matching to the at least portion of the second set of sequenced data preparation operations, retrieve a data traversal program representing the cached result; and

    provide, via the user interface, output based at least in part on the cached result represented by the retrieved data traversal program; and

    a memory coupled to the one or more processors and configured to provide the one or more processors with instructions.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×