MANAGEMENT OF COLLABORATIVE DATASETS VIA DISTRIBUTED COMPUTER NETWORKS
First Claim
1. A method comprising:
- receiving data representing a dataset having a data format into a collaborative dataset consolidation system;
receiving data representing attributes associated with the dataset, the attributes including an account identifier;
identifying a first version of the dataset associated with a first subset of atomized data points;
identifying a subset of data that varies from the first version of the dataset;
converting the subset of data to a second subset of atomized data points having a specific format similar to the first subset;
generating a second version of the dataset to include the first subset of atomized data points and the second subset of atomized data points; and
storing the first subset of atomized data points and the second subset of atomized data points as an atomized dataset in one or more repositories.
1 Assignment
0 Petitions
Accused Products
Abstract
Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset and dataset attributes and identifying a first version of the dataset. The method may include identifying data that varies from a first version of the dataset, and generating a second version of the dataset to include a first subset and a second subset of atomized data. The method may include storing subsets of atomized data points as an atomized dataset.
88 Citations
14 Claims
-
1. A method comprising:
-
receiving data representing a dataset having a data format into a collaborative dataset consolidation system; receiving data representing attributes associated with the dataset, the attributes including an account identifier; identifying a first version of the dataset associated with a first subset of atomized data points; identifying a subset of data that varies from the first version of the dataset; converting the subset of data to a second subset of atomized data points having a specific format similar to the first subset; generating a second version of the dataset to include the first subset of atomized data points and the second subset of atomized data points; and storing the first subset of atomized data points and the second subset of atomized data points as an atomized dataset in one or more repositories. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
Specification