Systems, methods, and apparatuses for populating a table having null values using a predictive query interface
First Claim
1. A method in a host organization, the method comprising:
- receiving a tabular dataset from a user as input, the tabular dataset having data values organized as columns and rows;
identifying a plurality of null values within the tabular dataset, the null values being dispersed across multiple rows and multiple columns of the tabular dataset;
generating indices from the tabular dataset of columns and rows, the indices representing probabilistic relationships between the rows and the columns of the tabular dataset;
displaying the tabular dataset as output to the user, the displayed output including the data values depicted as known values and the null values depicted as unknown values;
receiving input from the user to populate at least a portion of the unknown values within the displayed tabular dataset with predicted values;
querying the indices for the predicted values;
receiving a confidence indicator for every one of the plurality of null values within the tabular dataset responsive to querying the indices for the predicted values, the confidence indicator based on a comparison of known results corresponding to known and non-null values within the dataset with the predicted values; and
displaying the predicted values as updated output to the user, wherein displaying the predicted values as updated output to the user comprises displaying selected ones of the predicted values that correspond to the confidence indicator being in excess of a default minimum confidence threshold or a user specified minimum confidence threshold when present.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed herein are systems and methods for populating a table having null values using a predictive query interface including means for receiving a tabular dataset from a user as input, the tabular dataset having data values organized as columns and rows; identifying a plurality of null values within the tabular dataset, the null values being dispersed across multiple rows and multiple columns of the tabular dataset; generating indices from the tabular dataset of columns and rows, the indices representing probabilistic relationships between the rows and the columns of the tabular dataset; displaying the tabular dataset as output to the user, the displayed output including the data values depicted as known values and the null values depicted as unknown values; receiving input from the user to populate at least a portion of the unknown values within the displayed tabular dataset with predicted values; querying the indices for the predicted values; and displaying the predicted values as updated output to the user. Other related embodiments are further disclosed.
-
Citations
26 Claims
-
1. A method in a host organization, the method comprising:
-
receiving a tabular dataset from a user as input, the tabular dataset having data values organized as columns and rows; identifying a plurality of null values within the tabular dataset, the null values being dispersed across multiple rows and multiple columns of the tabular dataset; generating indices from the tabular dataset of columns and rows, the indices representing probabilistic relationships between the rows and the columns of the tabular dataset; displaying the tabular dataset as output to the user, the displayed output including the data values depicted as known values and the null values depicted as unknown values; receiving input from the user to populate at least a portion of the unknown values within the displayed tabular dataset with predicted values; querying the indices for the predicted values; receiving a confidence indicator for every one of the plurality of null values within the tabular dataset responsive to querying the indices for the predicted values, the confidence indicator based on a comparison of known results corresponding to known and non-null values within the dataset with the predicted values; and displaying the predicted values as updated output to the user, wherein displaying the predicted values as updated output to the user comprises displaying selected ones of the predicted values that correspond to the confidence indicator being in excess of a default minimum confidence threshold or a user specified minimum confidence threshold when present. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory computer readable storage medium having instructions stored thereon that, when executed by a processor in a host organization, the instructions cause the host organization to perform operations comprising:
-
receiving a tabular dataset from a user as input, the tabular dataset having data values organized as columns and rows; identifying a plurality of null values within the tabular dataset, the null values being dispersed across multiple rows and multiple columns of the tabular dataset; generating indices from the tabular dataset of columns and rows, the indices representing probabilistic relationships between the rows and the columns of the tabular dataset; displaying the tabular dataset as output to the user, the displayed output including the data values depicted as known values and the null values depicted as unknown values; receiving input from the user to populate at least a portion of the unknown values within the displayed tabular dataset with predicted values; querying the indices for the predicted values; receiving a confidence indicator for every one of the plurality of null values within the tabular dataset responsive to querying the indices for the predicted values, the confidence indicator based on a comparison of known results corresponding to known and non-null values within the dataset with the predicted values; and displaying the predicted values as updated output to the user, wherein displaying the predicted values as updated output to the user comprises displaying selected ones of the predicted values that correspond to the confidence indicator being in excess of a default minimum confidence threshold or a user specified minimum confidence threshold when present. - View Dependent Claims (21, 22)
-
-
23. A system to operate within a host organization, the system comprising:
-
a processor to execute instructions stored in memory of the system; a request interface to receive a tabular dataset from a user as input, the tabular dataset having data values organized as columns and rows; an analysis engine to identify a plurality of null values within the tabular dataset, the null values being dispersed across multiple rows and multiple columns of the tabular dataset; the analysis engine to further generate indices from the tabular dataset of columns and rows, the indices representing probabilistic relationships between the rows and the columns of the tabular dataset; the request interface to return the tabular dataset as display output to the user, the display output including the data values depicted as known values and the null values depicted as unknown values; the request interface to receive input from the user to populate at least a portion of the unknown values within the displayed tabular dataset with predicted values; a query interface to query the indices for the predicted values; the query interface to receive a confidence indicator for every one of the plurality of null values within the tabular dataset responsive to the query for the predicted values, the confidence indicator based on a comparison of known results corresponding to known and non-null values within the dataset with the predicted values; and the request interface to further return the predicted values as updated display output to the user, wherein displaying the predicted values as updated output to the user comprises displaying selected ones of the predicted values that correspond to the confidence indicator being in excess of a default minimum confidence threshold or a user specified minimum confidence threshold when present. - View Dependent Claims (24, 25, 26)
-
Specification