×

Knowledge catalysts

  • US 10,127,292 B2
  • Filed: 11/25/2013
  • Issued: 11/13/2018
  • Est. Priority Date: 12/03/2012
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method of integrating data from remote disparate data sources comprising a non-transitory media, comprising programming for:

  • detecting data sets in different formats having a plurality of fields hosted in a plurality of remote heterogeneous databases accessible through infrastructures that are coupled through a distributed network;

    extracting schema data of the plurality of remote heterogeneous databases;

    modeling each position of selected plurality of fields of the plurality of remote heterogeneous databases as a plurality of polynomials,identifying related fields in two or more of the plurality of remote heterogeneous databases by automatically hypothesizing data links based on column features that identify the number of distinct data items in each column in the plurality of remote heterogeneous databases and fuzzy logic matching that compares divergence of the plurality of polynomials; and

    linking the related fields automatically in the two or more of the plurality of remote heterogeneous databases through a virtual warehouse.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×