Loading collaborative datasets into data stores for queries via distributed computer networks
First Claim
1. A method comprising:
- receiving an atomized dataset to load into a graph-based data store, the atomized dataset including a data arrangement in which data is stored as an atomized data point with one or more other atomized data points of one or more data types as a consolidated dataset, the atomized data point being implemented as a triple, the data arrangement representing at least a portion of a graph, the atomized data point being a representation for a relationship between two data units, and the consolidated dataset having a plurality of atomized data points of the one or more data types also having links that, when parsed, identify one or more relationships between the plurality of atomized data points and the one or more data types including a resource associated with each of the atomized and the other atomized data points and a data type associated with the resource;
converting the atomized dataset, after being received, from a first data format to a second data format, the second data format being a collaborative data format configured to be used to form a portion of the graph;
determining resource requirements data to describe a capability to operate a database configured to access graph-based data to identify at least one resource requirement;
selecting a data store type based on the at least one resource requirement;
performing a load operation of the atomized dataset as a function of the data store;
receiving a query to access the atomized dataset;
classifying at least a portion of the query directed to the dataset to determine a classification type, whereby the classification type is associated with a type of query for a query portion associated with a specific entity; and
applying the portion of the query to at least one of a number of data stores, a subset of which includes one or more types of triplestore-based graph databases.
1 Assignment
0 Petitions
Accused Products
Abstract
Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a system may include an atomized workflow loader configured to receive an atomized dataset to load into a data store, and to determine resource requirements data to describe at least one resource requirement. The atomized workflow loader may be further configured to select a data store type based on a resource requirement, and perform a load operation of the atomized dataset as a function of the data store type.
197 Citations
16 Claims
-
1. A method comprising:
-
receiving an atomized dataset to load into a graph-based data store, the atomized dataset including a data arrangement in which data is stored as an atomized data point with one or more other atomized data points of one or more data types as a consolidated dataset, the atomized data point being implemented as a triple, the data arrangement representing at least a portion of a graph, the atomized data point being a representation for a relationship between two data units, and the consolidated dataset having a plurality of atomized data points of the one or more data types also having links that, when parsed, identify one or more relationships between the plurality of atomized data points and the one or more data types including a resource associated with each of the atomized and the other atomized data points and a data type associated with the resource; converting the atomized dataset, after being received, from a first data format to a second data format, the second data format being a collaborative data format configured to be used to form a portion of the graph; determining resource requirements data to describe a capability to operate a database configured to access graph-based data to identify at least one resource requirement; selecting a data store type based on the at least one resource requirement; performing a load operation of the atomized dataset as a function of the data store; receiving a query to access the atomized dataset; classifying at least a portion of the query directed to the dataset to determine a classification type, whereby the classification type is associated with a type of query for a query portion associated with a specific entity; and applying the portion of the query to at least one of a number of data stores, a subset of which includes one or more types of triplestore-based graph databases. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
a processor and a memory to store one or more executable instructions, the processor configured to execute instructions to implement an atomized workflow loader configured to receive an atomized dataset to load into a graph-based data store, the atomized dataset including a data arrangement in which data is stored as an atomized data point with one or more other atomized data points of one or more data types as a consolidated dataset, the atomized data point being implemented as a triple, the data arrangement representing at least a portion of a graph, the atomized data point being a representation for a relationship between two data units, and the consolidated dataset having a plurality of atomized data points of the one or more data types also having links that, when parsed, identify one or more relationships between the plurality of atomized data points and the one or more data types including a resource associated with each of the atomized and the other atomized data points and a data type associated with the resource, to convert the atomized dataset, after being received, from a first data format to a second data format, the second data format being a collaborative data format configured to be used to form a portion of the graph, to determine resource requirements data to describe a capability to operate a database configured to access graph-based data to identify at least one resource requirement, the atomized workflow loader further configured to select a data store type based on the at least one resource requirement, perform a load operation of the atomized dataset as a function of the data store type, receive a query to access the atomized dataset, classify at least a portion of the query directed to the dataset to determine a classification type, whereby the classification type is associated with a type of query for a query portion associated with a specific entity, and apply the portion of the query to at least one of a number of different data stores, a subset of which includes one or more types of triplestore-based graph databases. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
Specification