CONSOLIDATOR PLATFORM TO IMPLEMENT COLLABORATIVE DATASETS VIA DISTRIBUTED COMPUTER NETWORKS
First Claim
1. A system comprising:
- a dataset query engine configured to receive data representing a query, the dataset being associated with an identifier, and to identify datasets relevant to the query, the datasets being disposed in disparate data repositories, the dataset query engine further configured to determine a level of authorization associated with the identifier to access each of the datasets, to generate one or more queries based on the query to access the disparate data repositories, and to retrieve data representing query results from the accessed disparate data repositories.
1 Assignment
0 Petitions
Accused Products
Abstract
Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a system may include data ingestion controller configured to format datasets to form a first and a second atomized dataset, the second atomized dataset including the first atomized dataset and one or more other atomized datasets. The system may include a dataset query engine configured to identify a portion of a dataset relevant to a query, and to retrieve query results from at least one of different data repositories.
-
Citations
18 Claims
-
1. A system comprising:
a dataset query engine configured to receive data representing a query, the dataset being associated with an identifier, and to identify datasets relevant to the query, the datasets being disposed in disparate data repositories, the dataset query engine further configured to determine a level of authorization associated with the identifier to access each of the datasets, to generate one or more queries based on the query to access the disparate data repositories, and to retrieve data representing query results from the accessed disparate data repositories. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
13. A system comprising:
-
a data ingestion controller configured to; receive a data file including a dataset, and to format the dataset to form an atomized dataset including atomized data points each including data representing at least two objects and an association between the two objects, the data ingestion controller is further configured to form another atomized dataset including the atomized dataset and other atomized datasets; and a dataset query engine configured to receive data representing a query being associated with an identifier, the dataset query engine further configured to identify a subset of the another atomized dataset relevant to the query, wherein portions of the another atomized dataset are disposed in different data repositories, the dataset query engine also configured to generate a plurality of sub-queries each of which is configured to access at least one of the different data repositories, and to retrieve data representing query results from the at least one of the different data repositories. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification