Visualization suggestion application programming interface
First Claim
1. A method comprising:
- accessing a dataset and a user selection of at least one column of the dataset by at least one hardware processor, the dataset including at least one online analytical processing (OLAP) cube with each column of the cube classified as a measure or classified as a dimension;
determining, by the at least one hardware processor, which type of analysis to perform on each of the unselected columns of the dataset based on;
the classification of the at least one column selected by a user; and
the unselected column being classified as a dimension and a cardinality of the dimension satisfying specified criteria;
analyzing the dataset, by the at least one hardware processor, to generate a score for each unselected column of the dataset based on a degree of dependency between each of the unselected columns and the at least one selected column;
iteratively displaying a ranking of the unselected columns according to the scores, and accessing a user selection of one more column by the at least one hardware processor until a threshold number of columns has been selected;
accessing the selected columns of the dataset by the at least one hardware processor; and
selecting, by the at least one hardware processor, a specified number of visualization configurations compatible with the selected columns from a set of visualization configurations and providing the compatible visualization configurations to a user, wherein the selecting is based at least in part on a quantity of the selected columns and based at least in part on at least one constraint of a visualization configuration of the specified number of visualization configurations.
1 Assignment
0 Petitions
Accused Products
Abstract
A dataset and some user selected columns of the dataset are received by a statistical analysis module for analysis. The statistical analysis module generates a score for each unselected column of the dataset based on statistical analysis of the unselected columns and all or a subset of the selected columns. A ranking of the unselected columns is presented to the user for selection of one additional column of the dataset, after which the remaining unselected columns are re-ranked according to their associated scores and once again displayed to the user. The user may continue selecting from among the ranked columns until a threshold number of columns has been selected, at which point the user may deselect a selected column in order to continue selecting additional columns. A visualization suggestion application program interface then matches the selected columns with compatible visualization configurations and presents some of these visualizations to the user.
-
Citations
17 Claims
-
1. A method comprising:
-
accessing a dataset and a user selection of at least one column of the dataset by at least one hardware processor, the dataset including at least one online analytical processing (OLAP) cube with each column of the cube classified as a measure or classified as a dimension; determining, by the at least one hardware processor, which type of analysis to perform on each of the unselected columns of the dataset based on; the classification of the at least one column selected by a user; and the unselected column being classified as a dimension and a cardinality of the dimension satisfying specified criteria; analyzing the dataset, by the at least one hardware processor, to generate a score for each unselected column of the dataset based on a degree of dependency between each of the unselected columns and the at least one selected column; iteratively displaying a ranking of the unselected columns according to the scores, and accessing a user selection of one more column by the at least one hardware processor until a threshold number of columns has been selected; accessing the selected columns of the dataset by the at least one hardware processor; and selecting, by the at least one hardware processor, a specified number of visualization configurations compatible with the selected columns from a set of visualization configurations and providing the compatible visualization configurations to a user, wherein the selecting is based at least in part on a quantity of the selected columns and based at least in part on at least one constraint of a visualization configuration of the specified number of visualization configurations. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising:
-
at least one processors; a memory coupled to the at least one processor, the memory including instructions which, when executed by the at least one processor cause the system to perform operations comprising; accessing a dataset and a user selection of at least one column of the dataset the dataset including at least one online analytical processing (OLAP) cube with each column of the cube classified as a measure or classified as a dimension; determining, by the at least one processor, which type of analysis to perform on each of the unselected columns of the dataset based on; the classification of the at least one column selected by a user; and the unselected column being classified as a dimension and a cardinality of the dimension satisfying specified criteria; analyzing the dataset, by the at least one processor, to generate a score for each unselected column of the dataset based on a degree of dependency between each of the unselected columns and the at least one selected column; iteratively display a ranking of the unselected columns according to the scores, and accessing a user selection of one more column by the at least one processor until a threshold number of columns has been selected; accessing the selected columns of the dataset; selecting a specified number of visualization configurations compatible with the selected columns from a set of visualizations, wherein the selecting is based at least in part on a quantity of the selected columns and based at least in part on at least one constraint of a visualization configuration of the specified number of visualization configurations; and providing the compatible visualization configurations to a user. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A non-transitory machine-readable storage medium including instructions that, when executed on at least one processor of a machine, cause the machine to perform the operations comprising:
-
accessing a dataset and a user selection of at least one column of the dataset by at least one hardware processor, the dataset including at least one online analytical processing (OLAP) cube with each column of the cube classified as a measure or classified as a dimension; determining, by the at least one hardware processor, which type of analysis to perform on each of the unselected columns of the dataset based on; the classification of the at least one column selected by a user; and the unselected column being classified as a dimension and a cardinality of the dimension satisfying specified criteria; analyzing the dataset, by the at least one hardware processor, to generate a score for each unselected column of the dataset based on a degree of dependency between each of the unselected columns and the at least one selected column; iteratively displaying a ranking of the unselected columns according to the scores, and accessing a user selection of one more column by the at least one hardware processor until a threshold number of columns has been selected; accessing the selected columns of the dataset by the at least one hardware processor; and selecting, by the at least one hardware processor, a specified number of visualization configurations compatible with the selected columns from a set of visualization configurations and providing the compatible visualization configurations to a user, wherein the selecting is based at least in part on a quantity of the selected columns and based at least in part on at least one constraint of a visualization configuration of the specified number of visualization configurations. - View Dependent Claims (14, 15, 16, 17)
-
Specification