×

DATA VIRTUALIZATION ACROSS HETEROGENEOUS FORMATS

  • US 20160055184A1
  • Filed: 08/25/2014
  • Published: 02/25/2016
  • Est. Priority Date: 08/25/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method for virtualizing data across heterogeneous formats, the method comprising:

  • receiving, as input, a plurality of heterogeneous data sources;

    generating, for each of the plurality of heterogeneous data sources, a local schema graph comprising a set of attribute nodes and a set of type nodes, wherein an attribute node corresponds to a schema element in the heterogeneous data source comprising a domain with at least one value and is annotated with the value in the local schema graph, and wherein a type node corresponds to a schema element in the heterogeneous data source whose domain is defined recursively through at least one of one or more attribute nodes and one or more other type nodes; and

    generating a global schema graph based on each local schema graph that has been generated, wherein the global schema graph comprises each of the local schema graphs and at least one edge between at least one of two or more attributes nodes and two or more type nodes from different local schema graphs, and wherein the edge indicates a relationship between the data sources represented by the different local schema graphs comprising the two or more attributes nodes based on a computed similarity between at least one value associated with each of the two or more attributes nodes.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×