PLATFORM MANAGEMENT OF INTEGRATED ACCESS OF PUBLIC AND PRIVATELY-ACCESSIBLE DATASETS UTILIZING FEDERATED QUERY GENERATION AND QUERY SCHEMA REWRITING OPTIMIZATION
First Claim
1. A method, comprising:
- receiving at a dataset access platform a query formatted using a first schema, the query comprising data associated with a request to access a dataset;
generating a copy of the query;
saving the query and the copy to a datastore, the query being identified as a master;
parsing the copy of the query in the first schema, the parsing being performed by an inference engine to infer the first schema and an attribute associated with the query, and to generate a graph having one or more data links between the dataset and another dataset accessible by the dataset access platform;
determining, during the parsing, whether the query comprises other data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is configured to provide authentication data to access the dataset; and
rewriting, using a proxy server, the copy of the query in a second schema, if the access control condition indicates the query is configured to authenticate access to the dataset, the rewriting comprising converting, using a framework, the copy of the query into a triple associated with the query and another triple associated with the other data, the triple and the another triple being included in a rewritten query directed to one or more endpoints associated with the dataset access platform.
1 Assignment
0 Petitions
Accused Products
Abstract
Various techniques are described for platform management of integrated access of public and privately-accessible datasets utilizing federated query generation and query schema rewriting optimization, including receiving at a dataset access platform a query formatted according to a first data schema, generating a copy of the query, saving the query and the copy to a datastore, parsing the copy of the query in the first schema using an inference engine, determining whether the query comprises data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is permitted to access the dataset, and rewriting, using a proxy server, the copy of the query in a second schema by converting the copy of the query into a triple associated with the query and another triple associated with the access control condition.
-
Citations
20 Claims
-
1. A method, comprising:
-
receiving at a dataset access platform a query formatted using a first schema, the query comprising data associated with a request to access a dataset; generating a copy of the query; saving the query and the copy to a datastore, the query being identified as a master; parsing the copy of the query in the first schema, the parsing being performed by an inference engine to infer the first schema and an attribute associated with the query, and to generate a graph having one or more data links between the dataset and another dataset accessible by the dataset access platform; determining, during the parsing, whether the query comprises other data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is configured to provide authentication data to access the dataset; and rewriting, using a proxy server, the copy of the query in a second schema, if the access control condition indicates the query is configured to authenticate access to the dataset, the rewriting comprising converting, using a framework, the copy of the query into a triple associated with the query and another triple associated with the other data, the triple and the another triple being included in a rewritten query directed to one or more endpoints associated with the dataset access platform. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system, comprising:
-
a database configured to store data associated with a request to access a dataset and other data associated with an access control condition associated with accessing the dataset; and an application platform comprising logic configured to receive a query formatted using a first schema, the query comprising the data, to generate a copy of the query, to save the query and the copy to a datastore, the query being identified as a master, to parse the copy of the query in the first schema, the parsing being performed by an inference engine, the parsing being configured to identify the dataset, to infer an attribute associated with the query, and to generate one or more data links between the dataset and another dataset accessible by the dataset access platform, to determine, during the parsing, whether the query comprises the other data, the access control condition being configured to indicate whether the query is configured to provide authentication data to access the dataset, and to rewrite, using a proxy server, the copy of the query in a second schema, if the access control condition indicates the query is configured to authenticate access to the dataset, the rewriting comprising converting, using a framework, the copy of the query into a triple associated with the query and another triple associated with the other data, the triple and the another triple being included in a rewritten query directed to one or more endpoints associated with the dataset access platform. - View Dependent Claims (17, 18, 19)
-
-
20. A non-transitory computer readable medium having one or more computer program instructions configured to perform a method, the method comprising:
-
receiving at a dataset access platform a query formatted using a first schema, the query comprising data associated with a request to access a dataset; generating a copy of the query; saving the query and the copy to a datastore, the query being identified as a master; parsing the copy of the query in the first schema, the parsing being performed by an inference engine, the parsing being configured to identify the dataset, to infer an attribute associated with the query, and to generate one or more data links between the dataset and another dataset accessible by the dataset access platform; determining, during the parsing, whether the query comprises other data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is configured to provide authentication data to access the dataset; and rewriting, using a proxy server, the copy of the query in a second schema, if the access control condition indicates the query is configured to authenticate access to the dataset, the rewriting comprising converting, using a framework, the copy of the query into a triple associated with the query and another triple associated with the other data, the triple and the another triple being included in a rewritten query directed to one or more endpoints associated with the dataset access platform.
-
Specification