System for linking diverse data systems
First Claim
Patent Images
1. A method comprising:
- receiving a dataset at a communication interface, the dataset including dataset context information comprising metadata for the dataset;
determining by a processor a core model correlating to the dataset based on the dataset context information, the core model comprising a first dataset type node, a first database node, and a first relationship edge existing between the first dataset type node and the first database node, the first relationship edge establishing a relationship property for storage of the first dataset type in the first database node, the first database node corresponding to a first database of a plurality of databases in a diverse data system;
determining by the processor that a first portion of the dataset correlates to the first dataset type node;
referencing, by the processor, the first relationship edge between the first dataset type node and the first database node to determine the first database is a destination for storage of the first portion of the dataset;
transmitting by the communication interface the first portion of the dataset to the first database for storage;
instantiating, by linked data model (LDM) control circuitry, a first LDM instance in an LDM, the first LDM instance comprising;
definitional structure that represents the core model;
a representation of the first portion of the dataset as an instance of the first dataset type node of the core model; and
a representation of the first database as an instance of the first database node of the core model;
receiving by the communication interface a query for at least the first portion of the dataset from a querying entity;
referencing, by the LDM control circuitry, the first LDM instance to determine the first database as the database in which the first portion of the dataset is stored;
transmitting by the communication interface the first portion of the dataset to the querying entity;
determining by the processor a semantic query response to the query by referencing, via the LDM control circuitry, a domain knowledge graph of the LDM, the domain knowledge graph including the first LDM instance as an LDM instance node related to a second LDM instance node in the domain knowledge graph; and
transmitting by the communication interface an identification of the second LDM instance node to the querying entity.
1 Assignment
0 Petitions
Accused Products
Abstract
A system creates an abstraction layer surrounding a diverse data system including multiple different databases. Data is received from data sources and ingested into the various databases according to a core model. New instances of the core model are created and added to a larger linked data model (LDM) when new data sources are added to the system. The LDM captures the linkages between different linked data objects and links across different databases. Accordingly, applications are able to access or explore the linked data stored in different databases without prior knowledge of the linking relationships.
18 Citations
15 Claims
-
1. A method comprising:
-
receiving a dataset at a communication interface, the dataset including dataset context information comprising metadata for the dataset; determining by a processor a core model correlating to the dataset based on the dataset context information, the core model comprising a first dataset type node, a first database node, and a first relationship edge existing between the first dataset type node and the first database node, the first relationship edge establishing a relationship property for storage of the first dataset type in the first database node, the first database node corresponding to a first database of a plurality of databases in a diverse data system; determining by the processor that a first portion of the dataset correlates to the first dataset type node; referencing, by the processor, the first relationship edge between the first dataset type node and the first database node to determine the first database is a destination for storage of the first portion of the dataset; transmitting by the communication interface the first portion of the dataset to the first database for storage; instantiating, by linked data model (LDM) control circuitry, a first LDM instance in an LDM, the first LDM instance comprising; definitional structure that represents the core model; a representation of the first portion of the dataset as an instance of the first dataset type node of the core model; and a representation of the first database as an instance of the first database node of the core model; receiving by the communication interface a query for at least the first portion of the dataset from a querying entity; referencing, by the LDM control circuitry, the first LDM instance to determine the first database as the database in which the first portion of the dataset is stored; transmitting by the communication interface the first portion of the dataset to the querying entity; determining by the processor a semantic query response to the query by referencing, via the LDM control circuitry, a domain knowledge graph of the LDM, the domain knowledge graph including the first LDM instance as an LDM instance node related to a second LDM instance node in the domain knowledge graph; and transmitting by the communication interface an identification of the second LDM instance node to the querying entity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
a communication interface configured to receive a dataset including dataset context information comprising metadata associated with the dataset; a first core model including a first dataset type node, a first database node, and a first relationship edge existing between the first dataset type node and the first database node, the first relationship edge establishing a relationship property for storage of the first dataset type in the first database node, the first database node corresponding to a first database of a plurality of databases in a diverse data system; data ingestion circuitry in communication with the communication interface and configured to; determine the core model as correlating to the dataset based on the dataset context information; determine that a first portion of the dataset correlates to the first dataset type node; and reference the relationship edge between the first dataset node and the first database node in the core model to determine the first database is a destination for storage of the first portion of the dataset; the communication interface further configured to transmit the first portion of the dataset to the first database for storage; linked data model (LDM) maintenance circuitry in communication with the data ingestion circuitry and configured to; instantiate a first LDM instance in an LDM, the first LDM instance having, at least partially, a same structure as the core model and including a representation of the first portion of the dataset as an instance of the first dataset type node of the core model and a representation of the first database as an instance of the first database node of the core model; and maintain a domain knowledge graph of the LDM including a domain knowledge graph including a first node corresponding to the first LDM instance; and data exploration circuitry in communication with the communication interface and the LDM maintenance circuitry, the data exploration circuitry configured to; receive, from a user, a selection of the first node corresponding to the first LDM instance; communicate with the LDM maintenance circuitry to reference the first LDM instance to determine the first portion of the data set, the first database, and a second portion of the data set as associated with the first node; and provide to the user, the first portion of the dataset and a semantic response comprising an indication of an availability of a second portion of the dataset. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
Specification