×

Methods and systems to operate on group-by sets with high cardinality

  • US 9,633,104 B2
  • Filed: 05/05/2014
  • Issued: 04/25/2017
  • Est. Priority Date: 05/03/2013
  • Status: Active Grant
First Claim
Patent Images

1. A computer-program product tangibly embodied in a non-transitory machine-readable storage medium, the storage medium having instructions stored thereon, and the instructions being operable to cause a data-processing apparatus to perform operations including:

  • accessing a data set that includes multiple entries, each of the entries including data corresponding to multiple variables;

    grouping the multiple entries into group-by subsets, wherein the group-by subsets are formed on two or more group-by variables, and wherein the group-by subsets include multiple disjoint subsets of the data set, multiple intersecting subsets of the data set, or multiple subsets of the data set which are formed on different combinations of group-by variables;

    displaying an interface that facilitates defining a subset of the data set by referencing one or more of the group-by subsets;

    receiving an input at the interface, the input defining a subset of the data set by referencing at least one of the group-by subsets;

    generating a statistical summary of the defined subset;

    determining cardinality data for each of the group-by subsets, wherein cardinality data represents a number of entries in a group-by subset;

    generating at least one summary of data in each of the group-by subsets, wherein each of the summaries includes the cardinality data determined for the group-by subset;

    initializing objects for the group-by subsets, wherein each of the objects include the cardinality data and the at least one summary, and wherein each of the objects includes values of the group-by variables used in forming the group-by subset; and

    generating multiple statistical summaries of the data set using the objects.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×