Computer systems and methods for automatic generation of models for a dataset
First Claim
Patent Images
1. A method of automatically generating statistical models for a dataset, comprising:
- at a computer system having one or more processors and memory storing programs executed by the one or more processors;
receiving a user description of a graphical view of a dataset, wherein the user description specifies a relationship between a first field and a second field of the dataset;
rendering a graphical visualization of the relationship between the first field and the second field of the dataset using data values associated with the first and second fields in the graphical view of the dataset;
receiving a user instruction to render a statistical model on top of the graphical visualization of the relationship between the first field and the second field of the dataset using data values associated with the first and second fields in the graphical view of the dataset;
in response to the user instruction;
determining a set of data properties for each of the first and second fields; and
determining whether there is a mathematical formula describing the relationship between the first field and the second field of the dataset in accordance with the respective set of data properties; and
when there is a mathematical formula describing the relationship between the first field and the second field of the dataset;
generating one or more statistical models of the dataset in accordance with the mathematical formula and the data values associated with the first and second fields in the dataset; and
rendering at least one graphical representation of the one or more statistical models on top of the graphical visualization of the relationship between the first field and the second field of the dataset in the graphical view of the dataset.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of automatically generating models from a dataset includes multiple steps. First, a description of a view of a dataset is provided. The description includes multiple fields associated with the dataset. Next, a set of properties is determined for each of the multiple fields. Finally, the description is automatically translated into one or more models based on the respective properties of the multiple fields and a set of predefined heuristics.
-
Citations
23 Claims
-
1. A method of automatically generating statistical models for a dataset, comprising:
at a computer system having one or more processors and memory storing programs executed by the one or more processors; receiving a user description of a graphical view of a dataset, wherein the user description specifies a relationship between a first field and a second field of the dataset; rendering a graphical visualization of the relationship between the first field and the second field of the dataset using data values associated with the first and second fields in the graphical view of the dataset; receiving a user instruction to render a statistical model on top of the graphical visualization of the relationship between the first field and the second field of the dataset using data values associated with the first and second fields in the graphical view of the dataset; in response to the user instruction; determining a set of data properties for each of the first and second fields; and determining whether there is a mathematical formula describing the relationship between the first field and the second field of the dataset in accordance with the respective set of data properties; and when there is a mathematical formula describing the relationship between the first field and the second field of the dataset; generating one or more statistical models of the dataset in accordance with the mathematical formula and the data values associated with the first and second fields in the dataset; and rendering at least one graphical representation of the one or more statistical models on top of the graphical visualization of the relationship between the first field and the second field of the dataset in the graphical view of the dataset. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
12. A computer system, comprising:
-
memory; one or more processors; and at least one program stored in the memory, wherein the at least one program, when executed by the one or more processors, causes the computer system to; receive a user description of a graphical view of a dataset, wherein the user description specifies a relationship between a first field and a second field of the dataset; render a graphical visualization of the relationship between the first field and the second field of the dataset using data values associated with the first and second fields in the graphical view of the dataset; receive a user instruction to render a statistical model on top of the graphical visualization of the relationship between the first field and the second field of the dataset using data values associated with the first and second fields in the graphical view of the dataset; in response to the user instruction; determine a set of data properties for each of the first and second fields; and determine whether there is a mathematical formula describing the relationship between the first field and the second field of the dataset in accordance with the respective set of data properties; and when there is a mathematical formula describing the relationship between the first field and the second field of the dataset; generate one or more statistical models of the dataset in accordance with the mathematical formula and the data values associated with the first and second fields in the dataset; and render at least one graphical representation of the one or more statistical models on top of the graphical visualization of the relationship between the first field and the second field of the dataset in the graphical view of the dataset. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A non-transitory computer readable storage medium storing a computer program product that includes one or more programs configured for execution by a computer system that includes one or more processors and memory, the one or more programs comprising instructions that, if executed by the computer system, cause the computer system to:
-
receive a user description of a graphical view of a dataset, wherein the user description specifies a relationship between a first field and a second field of the dataset; render a graphical visualization of the relationship between the first field and the second field of the dataset using data values associated with the first and second fields in the graphical view of the dataset; receive a user instruction to render a statistical model on top of the graphical visualization of the relationship between the first field and the second field of the dataset using data values associated with the first and second fields in the graphical view of the dataset; in response to the user instruction; determine a set of data properties for each of the first and second fields; and determine whether there is a mathematical formula describing the relationship between the first field and the second field of the dataset in accordance with the respective set of data properties; and when there is a mathematical formula describing the relationship between the first field and the second field of the dataset; generate one or more statistical models of the dataset in accordance with the mathematical formula and the data values associated with the first and second fields in the dataset; and render at least one graphical representation of the one or more statistical models on top of the graphical visualization of the relationship between the first field and the second field of the dataset in the graphical view of the dataset. - View Dependent Claims (19, 20, 21, 22, 23)
-
Specification