×

DETERMINING A DEGREE OF SIMILARITY OF A SUBSET OF TABULAR DATA ARRANGEMENTS TO SUBSETS OF GRAPH DATA ARRANGEMENTS AT INGESTION INTO A DATA-DRIVEN COLLABORATIVE DATASET PLATFORM

  • US 20190095472A1
  • Filed: 09/20/2018
  • Published: 03/28/2019
  • Est. Priority Date: 03/09/2017
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • identifying subsets of data as columnar data associated with a data arrangement, the data arrangement being a tabular data arrangement including each of the subsets of data as a column of data;

    generating a similarity matrix of data associated with a subset of data for each column of data, the similarity matrix of data being configured to determine a degree of similarity to other datasets with which to join;

    accessing a plurality of similarity matrices each formed to identify an amount of relevant data associated with a dataset disposed in a graph data arrangement;

    analyzing the similarity matrix of data in view of the plurality of similarity matrices;

    identifying a subset of the plurality of similarity matrices to form a subset of relevant similarity matrices;

    generating links among the column of data and a subset of the other datasets associated with the subset of relevant similarity matrices; and

    forming a subset of the links between the column of data and at least one of the other datasets.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×