SYSTEMS AND METHODS OF DATA ANALYTICS
First Claim
1. A method of performing data analytics, the method being performed on a data analytics system comprising a non-transitory computer-readable storage medium and a processor attached thereto, the method comprising:
- storing, in the computer-readable storage, one or more applications, each application being associated with an algorithm, each application being further associated with canonical data indicative of a class of data to be accepted by the algorithm associated with the application;
storing, in the computer-readable storage, one or more questions, each question being associated with an application;
storing a user dataset associated with a domain and a data category;
selecting a question from the one or more questions, the selected question being selected based at least in part on the domain and the data category of the user dataset;
matching the user dataset based on the canonical data of the application associated with the selected question, the matching being performed by the processor, the matching comprising comparing one or more fields of the user dataset with the class of data indicated by the canonical data, the matching thereby producing a canonicalized dataset;
executing the algorithm associated with the application, wherein the canonicalized dataset is provided as input to the algorithm; and
presenting output from the algorithm to the user.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods of data analytics, which in various embodiments enable business analysts to apply certain machine learning and analytics algorithms in a self-service manner by binding them to generic business questions that they can be used to answer in particular domains. The general approach may be to define the application of an algorithm to solve specific problems (questions) for particular combinations of a business domain and a data category. At design time, the algorithm may be linked to canonical data within a data category and programmed to run with this canonical data set. At runtime, given a dataset and its category, and a business domain, a user may choose from the corresponding questions and the system may run the algorithm bound to that question.
21 Citations
17 Claims
-
1. A method of performing data analytics, the method being performed on a data analytics system comprising a non-transitory computer-readable storage medium and a processor attached thereto, the method comprising:
-
storing, in the computer-readable storage, one or more applications, each application being associated with an algorithm, each application being further associated with canonical data indicative of a class of data to be accepted by the algorithm associated with the application; storing, in the computer-readable storage, one or more questions, each question being associated with an application; storing a user dataset associated with a domain and a data category; selecting a question from the one or more questions, the selected question being selected based at least in part on the domain and the data category of the user dataset; matching the user dataset based on the canonical data of the application associated with the selected question, the matching being performed by the processor, the matching comprising comparing one or more fields of the user dataset with the class of data indicated by the canonical data, the matching thereby producing a canonicalized dataset; executing the algorithm associated with the application, wherein the canonicalized dataset is provided as input to the algorithm; and presenting output from the algorithm to the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer system, comprising:
-
an application store comprising computer-readable storage, having stored therein one or more applications, each application being associated with an algorithm, each application being further associated with canonical data indicative of a class of data to be accepted by the algorithm associated with the application; a question store comprising computer-readable storage, having stored therein one or more questions, each question being associated with an application; a user dataset store comprising computer-readable storage, having stored therein a user dataset associated with a domain and a data category; a question selection module configured to select a question from the one or more questions, the selected question being selected based at least in part on the domain and the data category of the user dataset; a dataset matching module configured to match the user dataset based on the canonical data of the application associated with the selected question, the matching being performed by the processor, the matching comprising comparing one or more fields of the user dataset with the class of data indicated by the canonical data, the matching thereby producing a canonicalized dataset; and an application execution module configured to execute the algorithm associated with the application, wherein the canonicalized dataset is provided as input to the algorithm.
-
-
17. A method of performing data analytics, the method being performed using a computer processor, the method comprising:
-
receiving a user dataset; selecting a question that may be answered with respect to the user dataset from a plurality of questions, the selection being based on stored attributes of the plurality of questions and further being based on attributes of the user dataset; reconfiguring the user dataset to conform with one or more inputs associated with an algorithm, the algorithm being identified by the computer processor as being configured to respond to the selected question; and executing the algorithm based on the reconfigured user dataset.
-
Specification