Systems and methods for managing statistical expressions
First Claim
1. A system comprising:
- a processor configured to execute instructions stored on a non-transitory computer-readable medium, the instructions including;
an interface engine configured for providing a user interface, the user interface including;
a definitions section that defines subsets of data by (i) displaying a subset assignment operator, (ii) receiving a word or phrase identifying a defined subset, (iii) appending the word or phrase identifying the subset to the subset assignment operator, (iv) appending a selection preposition to the word or phrase identifying the subset, wherein the selection preposition is a word or phrase indicative of selecting a data set from a plurality of data sets of a data source, (v) displaying a list including the plurality of data sets, (vi) receiving input selecting the data set from the list, (vii) appending a word or phrase identifying the selected data set to the selection preposition, (viii) displaying a parameter assignment operator adjacent to the words or phrases identifying the defined subset and the selected data set, (ix) receiving input identifying a parameter for the selected data set, (x) appending a word or phrase identifying the parameter of the data set to the parameter assignment operator, wherein the parameter is usable for specifying the defined subset, (xi) receiving input identifying a value or range of values for the parameter usable for identifying data elements from the data set to be included in the defined subset, and (xii) storing a definition of the subset based on the input received to the user interface, the definition comprising the word or phrase identifying the defined subset, the word or phrase identifying the selected dataset, and the value or range of values for the parameter;
a composite collections section configured for receiving input identifying a composite subset of data, wherein the composite subset of data comprises at least two subsets of data defined via the definitions section;
an attribute section configured for receiving input defining a statistical expression to perform on the defined subset of data; and
a natural language input field configured for receiving a rule via a natural language input, the rule identifying the statistical expression and the defined subset of data; and
an attribute engine configured for (i) identifying a translator application configured for translating the rule;
(ii) providing translation instructions to the translator application, wherein the translation instructions translate the rule into an executable programming language or a machine-readable language by correlating words or phrases in the natural language input to at least one variable and at least one operator based on a mapping for a language of the natural language input (iii) accessing the defined subset of data, (iv) executing the executable programming language or the machine-readable language to obtain statistics on data elements of the defined subset of data in response to receiving the executable programming language or a machine-readable language, and (v) outputting the statistics, wherein the statistics describe an attribute of the defined subset of data.
1 Assignment
0 Petitions
Accused Products
Abstract
Using natural language-like user inputs to provide statistics on a subset of data is described. In one embodiment, a user input that includes at least one word or phrase representing a rule is received. The rule includes an identification of a subset of data and a statistical expression to be performed on the subset of data. The subset of data includes at least part of the data elements of a data set. Each data element includes information on an individual or group. Instructions are provided for translating the rule into an executable format. The executable format includes a translated identification of the subset and a translated statistical expression. The subset of the data is accessed using the translated identification of the subset. The translated statistical expression is executed to obtain statistics on data elements of the subset of data. The statistics on the data elements are provided.
111 Citations
19 Claims
-
1. A system comprising:
a processor configured to execute instructions stored on a non-transitory computer-readable medium, the instructions including; an interface engine configured for providing a user interface, the user interface including; a definitions section that defines subsets of data by (i) displaying a subset assignment operator, (ii) receiving a word or phrase identifying a defined subset, (iii) appending the word or phrase identifying the subset to the subset assignment operator, (iv) appending a selection preposition to the word or phrase identifying the subset, wherein the selection preposition is a word or phrase indicative of selecting a data set from a plurality of data sets of a data source, (v) displaying a list including the plurality of data sets, (vi) receiving input selecting the data set from the list, (vii) appending a word or phrase identifying the selected data set to the selection preposition, (viii) displaying a parameter assignment operator adjacent to the words or phrases identifying the defined subset and the selected data set, (ix) receiving input identifying a parameter for the selected data set, (x) appending a word or phrase identifying the parameter of the data set to the parameter assignment operator, wherein the parameter is usable for specifying the defined subset, (xi) receiving input identifying a value or range of values for the parameter usable for identifying data elements from the data set to be included in the defined subset, and (xii) storing a definition of the subset based on the input received to the user interface, the definition comprising the word or phrase identifying the defined subset, the word or phrase identifying the selected dataset, and the value or range of values for the parameter; a composite collections section configured for receiving input identifying a composite subset of data, wherein the composite subset of data comprises at least two subsets of data defined via the definitions section; an attribute section configured for receiving input defining a statistical expression to perform on the defined subset of data; and a natural language input field configured for receiving a rule via a natural language input, the rule identifying the statistical expression and the defined subset of data; and an attribute engine configured for (i) identifying a translator application configured for translating the rule;
(ii) providing translation instructions to the translator application, wherein the translation instructions translate the rule into an executable programming language or a machine-readable language by correlating words or phrases in the natural language input to at least one variable and at least one operator based on a mapping for a language of the natural language input (iii) accessing the defined subset of data, (iv) executing the executable programming language or the machine-readable language to obtain statistics on data elements of the defined subset of data in response to receiving the executable programming language or a machine-readable language, and (v) outputting the statistics, wherein the statistics describe an attribute of the defined subset of data.- View Dependent Claims (2, 3, 4, 5, 6)
-
7. A method comprising:
-
defining subsets of data by (i) displaying a subset assignment operator, (ii) receiving a word or phrase identifying a defined subset, (iii) appending the word or phrase identifying the subset to the subset assignment operator, (iv) appending a selection preposition to the word or phrase identifying the subset, wherein the selection preposition is a word or phrase indicative of selecting a data set from a plurality of data sets of a data source, (v) displaying a list including the plurality of data sets, (vi) receiving input selecting the data set from the list, (vii) appending a word or phrase identifying the selected data set to the selection preposition, (viii) displaying a parameter assignment operator adjacent to the words or phrases identifying the defined subset and the selected data set, (ix) receiving input identifying a parameter for the selected data set, (x) appending a word or phrase identifying the parameter of the data set to the parameter assignment operator, wherein the parameter is usable for specifying the defined subset, (xi) receiving input identifying a value or range of values for the parameter usable for identifying data elements from the data set to be included in the defined subset, and (xii) storing a definition of the subset based on the input received to the user interface, the definition comprising the word or phrase identifying the defined subset, the word or phrase identifying the selected dataset, and the value or range of values for the parameter; receiving input identifying a composite subset of data, wherein the composite subset of data comprises at least two defined subsets of data; receiving input defining a statistical expression to perform on the defined subset of data; receiving natural language input identifying the statistical expression and the defined subset of data; identifying a translator application configured for translating the rule; providing translation instructions to the translator application, wherein the translation instructions translate the rule into an executable programming language or a machine-readable language by correlating words or phrases in the natural language input to at least one variable and at least one operator based on a mapping for a language of the natural language input; accessing the defined subset of data; executing the executable programming language or the machine-readable language to obtain statistics on data elements of the defined subset of data in response to receiving the executable programming language or a machine-readable language; and outputting the statistics, wherein the statistics describe an attribute of the defined subset of data.
-
-
8. A computer-readable medium on which program code configured for being executed by a processor is stored, the program code comprising:
-
program code for defining subsets of data by (i) displaying a subset assignment operator, (ii) receiving a word or phrase identifying a defined subset, (iii) appending the word or phrase identifying the subset to the subset assignment operator, (iv) appending a selection preposition to the word or phrase identifying the subset, wherein the selection preposition is a word or phrase indicative of selecting a data set from a plurality of data sets of a data source, (v) displaying a list including the plurality of data sets, (vi) receiving input selecting the data set from the list, (vii) appending a word or phrase identifying the selected data set to the selection preposition, (viii) displaying a parameter assignment operator adjacent to the words or phrases identifying the defined subset and the selected data set, (ix) receiving input identifying a parameter for the selected data set, (x) appending a word or phrase identifying the parameter of the data set to the parameter assignment operator, wherein the parameter is usable for specifying the defined subset, (xi) receiving input identifying a value or range of values for the parameter usable for identifying data elements from the data set to be included in the defined subset, and (xii) storing a definition of the subset based on the input received to the user interface, the definition comprising the word or phrase identifying the defined subset, the word or phrase identifying the selected dataset, and the value or range of values for the parameter; program code for receiving input identifying a composite subset of data, wherein the composite subset of data comprises at least two defined subsets of data; program code for receiving input defining a statistical expression to perform on the defined subset of data; program code for receiving a rule via a natural language identifying the statistical expression and the defined subset of data; program code for identifying a translator application configured for translating the rule; program code for providing translation instructions to the translator application, wherein the translation instructions translate the rule into an executable programming language or a machine-readable language by correlating words or phrases in the natural language input to at least one variable and at least one operator based on a mapping for a language of the natural language input; program code for defined subset of data; program code for executing the executable programming language or the machine-readable language to obtain statistics on data elements of the defined subset of data in response to receiving the executable programming language or a machine-readable language; and program code for outputting the statistics, wherein the statistics describe an attribute of the defined subset of data.
-
-
9. A system comprising:
-
a processor configured to execute instructions stored on a non-transitory computer-readable medium, the instructions including; an interface engine configured for providing a user interface, the user interface including; a definitions section that defines subsets of data by (i) displaying a subset assignment operator, (ii) receiving a word or phrase identifying a defined subset, (iii) appending the word or phrase identifying the subset to the subset assignment operator, (iv) appending a selection preposition to the word or phrase identifying the subset, wherein the selection preposition is a word or phrase indicative of selecting a data set from a plurality of data sets of a data source, (vi) displaying a list including the plurality of data sets, (iv) receiving input selecting the data set from the list, (v) appending a word or phrase identifying the selected data set to the selection preposition, and (vi) storing a definition of the subset based on the input received to the user interface, the definition comprising the word or phrase identifying the defined subset, and the word or phrase identifying the selected dataset; and a natural language input field configured for receiving a rule via a natural language input, the rule identifying the statistical expression and the defined subset of data; and an attribute engine configured for (i) providing translation instructions that translate the rule into an executable programming language or a machine-readable language by correlating words or phrases in the natural language input to at least one variable and at least one operator based on a mapping for a language of the natural language input, (ii) accessing the defined subset of data, (iii) executing the executable programming language or the machine-readable language to obtain statistics on data elements of the defined subset of data, and (iv) outputting the statistics, wherein the statistics describe an attribute of the defined subset of data. - View Dependent Claims (10, 11, 12)
-
-
13. A system comprising:
-
a processor configured to execute instructions stored on a non-transitory computer-readable medium, the instructions including; an interface engine configured for providing a user interface, the user interface including a natural language input field configured for receiving a rule via a natural language input, the rule identifying including a statistical expression to perform on a defined subset of data and the defined subset of data; and an attribute engine configured for (i) providing translation instructions that translate the rule into an executable programming language or a machine-readable language by correlating words or phrases in the natural language input to at least one variable and at least one operator based on a mapping for a language of the natural language input, (ii) accessing the defined subset of data, (iii) executing the executable programming language or the machine-readable language to obtain statistics on data elements of the defined subset of data by executing an aggregate arithmetic or quadratic function combining a plurality of statistical functions applied to respective subsets of data, wherein each of the subsets of data is identified by applying a respective filter or transformation function to entries in a data set having a respective specified value, and (iv) outputting the statistics, wherein the statistics describe an attribute of the defined subset of data. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification