Systems and methods for generating schemas that represent multiple data sources
First Claim
1. A method for generating a negotiated schema for multiple data sources, the method comprising:
- interrogating multiple data sources via a computing device communicatively couplable with each of the multiple data sources, each of the multiple data sources including different, unknown types of data structures that define how data is stored in each data source, the interrogation of at least one of the data sources including;
comparing the unknown data structure of the data source to a plurality of schema templates,determining a fuzzy template match between the unknown data structure and one or more of the plurality of schema templates based on the comparison, andselecting a schema representing the unknown data structure that is a fuzzy template match between the unknown data structure and a schema template of one or more of the plurality of schema templates;
generating a negotiated schema by;
comparing the selected schemas of the multiple data sources to one another, the selected schemas including at least one schema representing the unknown data structure that is the fuzzy template match; and
interrelating data points of the multiple data sources that correspond to one another;
interrelating the negotiated schema with the selected schema for each of the multiple data sources based upon the interrelation of the data points; and
storing the negotiated schema in a storage media by way of the computing device.
5 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods generating schemas that represent multiple data sources are provided herein. According to some embodiments, methods may include determining a schema for each of the multiple data sources via a computing device communicatively couplable with each of the multiple data sources, each of the multiple data sources including one or more data structures that define how data is stored in the data source, generating a negotiated schema by comparing the schemas of the multiple data sources to one another and interrelating data points of the multiple data sources based upon the schemas, interrelating the negotiated schema with the schema for each of the multiple data sources based upon the interrelation of the data points, and storing the negotiated schema in a storage media by way of the computing device.
-
Citations
20 Claims
-
1. A method for generating a negotiated schema for multiple data sources, the method comprising:
-
interrogating multiple data sources via a computing device communicatively couplable with each of the multiple data sources, each of the multiple data sources including different, unknown types of data structures that define how data is stored in each data source, the interrogation of at least one of the data sources including; comparing the unknown data structure of the data source to a plurality of schema templates, determining a fuzzy template match between the unknown data structure and one or more of the plurality of schema templates based on the comparison, and selecting a schema representing the unknown data structure that is a fuzzy template match between the unknown data structure and a schema template of one or more of the plurality of schema templates; generating a negotiated schema by; comparing the selected schemas of the multiple data sources to one another, the selected schemas including at least one schema representing the unknown data structure that is the fuzzy template match; and interrelating data points of the multiple data sources that correspond to one another; interrelating the negotiated schema with the selected schema for each of the multiple data sources based upon the interrelation of the data points; and storing the negotiated schema in a storage media by way of the computing device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for generating a negotiated schema for multiple data sources, the system comprising:
-
a memory for storing executable instructions; and a processor for executing the instructions, the instructions including; an interrogation module that interrogates multiple data sources, each of the multiple data sources including different, unknown types of data structures that define how data is stored in each data source, the interrogation of at least one of the data sources including; comparing the unknown data structure of the data source to a plurality of schema templates, determining a fuzzy template match between the unknown data structure and each of the plurality of schema templates based on the comparison, and selecting a schema representing the unknown data structure that is a fuzzy template match between the unknown data structure and a schema template of one or more of the plurality of schema templates; and a schema generator that generates a negotiated schema by; applying heuristic logic to determine correspondence between data points of the schemas of the multiple data sources to at least one of one another and one or more pre-existing schemas; interrelating data points of the multiple data sources that correspond to one another; interrelating the negotiated schema with selected schema for each of the multiple data sources based upon the interrelation of the data points; and storing the negotiated schema in a storage media. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory computer readable storage media having a program embodied thereon, the program being executable by a processor to perform a method for generating a negotiated schema for multiple data sources, the method comprising:
-
interrogating multiple data sources via a computing device communicatively couplable with each of the multiple data sources, each of the multiple data sources including one or more different, unknown types of data structures that define how data is stored in each data source, the interrogation of each of the data sources including; comparing the unknown data structure of the data source to a plurality of schema templates, identifying a plurality of schema templates that partially correspond to the unknown data structure of the data source, and using heuristic logic to select a schema for representing the unknown data structure, the selected schema being a closest partial match between the unknown data structure and the plurality of identified schema templates; generating a negotiated schema by; establishing linkages between multiple data sources using an adaptive algorithm; and interrelating data points of the multiple data sources that correspond to one another; interrelating the negotiated schema with the selected schema for each of the multiple data sources based upon the interrelation of the data points; and storing the negotiated schema in a storage media by way of the computing device.
-
Specification