Predictive modeling and data analysis in a secure shared system
First Claim
Patent Images
1. A processor-implemented system comprising:
- (a) a source server for managing access to a data source;
(b) a dataset server for creating and managing access to a dataset created from the data source;
(c) a model server for creating and managing access to a model based on the dataset;
(d) a prediction server for creating and managing access to a prediction that results from utilizing the model;
(e) a user interface component that implements a corresponding private dashboard for each one of plural users of the system, wherein each private dashboard stores resources for the corresponding user without exposing the stored resources to any other user of the system; and
(f) a public gallery component that, in cooperation with the user interface, implements a public gallery to enable a first user to selectively advertise a dataset for use by other users of the system to build a model of the dataset, wherein the public gallery component displays a summary of the advertised dataset;
wherein the summary comprises a screen display including, for each field in the dataset, a field name and a histogram of data in the field; and
wherein the public gallery component is configurable to advertise a model for use by other users of the system, by displaying a summary of the advertised model.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method enables users to selectively expose and optionally monetize their data resources, for example on a web site. Data assets such as datasets and models can be exposed by the proprietor on a public gallery for use by others. Fees may be charged, for example, per new model, or per prediction using a model. Users may selectively expose public datasets or public models while keeping their raw data private.
39 Citations
26 Claims
-
1. A processor-implemented system comprising:
-
(a) a source server for managing access to a data source; (b) a dataset server for creating and managing access to a dataset created from the data source; (c) a model server for creating and managing access to a model based on the dataset; (d) a prediction server for creating and managing access to a prediction that results from utilizing the model; (e) a user interface component that implements a corresponding private dashboard for each one of plural users of the system, wherein each private dashboard stores resources for the corresponding user without exposing the stored resources to any other user of the system; and (f) a public gallery component that, in cooperation with the user interface, implements a public gallery to enable a first user to selectively advertise a dataset for use by other users of the system to build a model of the dataset, wherein the public gallery component displays a summary of the advertised dataset; wherein the summary comprises a screen display including, for each field in the dataset, a field name and a histogram of data in the field; and wherein the public gallery component is configurable to advertise a model for use by other users of the system, by displaying a summary of the advertised model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A processor-implemented system comprising:
-
a dataset server for creating and managing access to a dataset wherein the dataset comprises a plurality of records, each record comprising a plurality of fields; a model server for creating and managing access to a model based on the dataset, wherein the model is a structured representation of the dataset with predictive power; a prediction server for creating a prediction from input data by utilizing the model; a user interface component that implements a corresponding private dashboard for at least one of plural users of the system; and a public gallery component that implements a public gallery that enables a first user to selectively advertise a dataset in the public gallery for use by other users of the system, wherein the public gallery component displays a summary of the advertised dataset; wherein the summary of the dataset comprises a screen display including, for each field in the dataset, a field name and a histogram of data in the field; and wherein the public gallery component further enables the first user to selectively advertise a model owned by the first user for potential use by other users of the system. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory, machine readable storage medium having stored thereon a series of instructions for causing one or more processors to perform operations comprising:
-
accessing a data source to acquire raw data comprising a plurality of records of plural fields; processing the acquired raw data to form a corresponding dataset; responsive to input from a first user who owns the dataset, publishing the dataset to a public gallery;
wherein publishing the dataset comprises a screen display including, for each field in the dataset, a field name and a histogram of data in the field;collecting a fee from a second user to utilize the published dataset; paying substantially the collected fee to the first user; cloning the dataset to form a copy; and providing the copy for exclusive use by the second user; creating a model of a dataset; responsive to input from a first user who owns the model, publishing a black-box version of the model to a public gallery, the black-box version enabling use of the model by a second user to make predictions. - View Dependent Claims (21, 22)
-
-
23. A computer-implemented method comprising:
-
storing a prediction model in a private workspace associated with a first user of a computing system, so that the prediction model is not visible to any other user of the system; responsive to an input from the first user, displaying a summary of the prediction model to other users of the system in a public gallery wherein the summary includes a tree visualization of the prediction model and metadata describing a dataset underlying the prediction model; receiving input data from a second user of the system; applying the prediction model to the received input data to form a prediction result; and providing the prediction result to the second user without disclosing the prediction model. - View Dependent Claims (24, 25, 26)
-
Specification