MANAGEMENT OF COLLABORATIVE DATASETS VIA DISTRIBUTED COMPUTER NETWORKS

US 20170364703A1
Filed: 06/19/2016
Published: 12/21/2017
Est. Priority Date: 06/19/2016
Status: Active Grant

First Claim

Patent Images

1. A method comprising:

receiving data representing a dataset having a data format into a collaborative dataset consolidation system;

receiving data representing attributes associated with the dataset, the attributes including an account identifier;

identifying a first version of the dataset associated with a first subset of atomized data points;

identifying a subset of data that varies from the first version of the dataset;

converting the subset of data to a second subset of atomized data points having a specific format similar to the first subset;

generating a second version of the dataset to include the first subset of atomized data points and the second subset of atomized data points; and

storing the first subset of atomized data points and the second subset of atomized data points as an atomized dataset in one or more repositories.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset and dataset attributes and identifying a first version of the dataset. The method may include identifying data that varies from a first version of the dataset, and generating a second version of the dataset to include a first subset and a second subset of atomized data. The method may include storing subsets of atomized data points as an atomized dataset.

88 Citations

View as Search Results

14 Claims

1. A method comprising:
- receiving data representing a dataset having a data format into a collaborative dataset consolidation system;
  
  receiving data representing attributes associated with the dataset, the attributes including an account identifier;
  
  identifying a first version of the dataset associated with a first subset of atomized data points;
  
  identifying a subset of data that varies from the first version of the dataset;
  
  converting the subset of data to a second subset of atomized data points having a specific format similar to the first subset;
  
  generating a second version of the dataset to include the first subset of atomized data points and the second subset of atomized data points; and
  
  storing the first subset of atomized data points and the second subset of atomized data points as an atomized dataset in one or more repositories.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
- - 2. The method of claim 1 wherein each of the atomized data point is data representing an addressable fact.
  - 3. The method of claim 1 wherein storing the first subset of atomized data points and the second subset of atomized data points as the atomized dataset comprises:
    - storing atomized data points as triples.
  - 4. The method of claim 3 wherein at least one triple of the triples are formatted to comply with a Resource Description Framework (“
    - RDF”
      
      ) data model.
  - 5. The method of claim 1 wherein generating the second version of the dataset to include the first subset of atomized data points comprises:
    - generating a data pointer to a memory location at which the first subset of atomized data points is stored.
  - 6. The method of claim 5 wherein storing the first subset of atomized data points comprises:
    - storing the data pointer as the first subset of atomized data points.
  - 7. The method of claim 1 further comprising:
    - identifying that the subset of data is associated with a protected dataset;
      
      determining access to the protected dataset is authorized in association with the account identifier; and
      
      forming the second version of the dataset.
  - 8. The method of claim 1 further comprising:
    - receiving a request in association with another account identifier to access the atomized dataset;
      
      determining access to the subset of data is not authorized; and
      
      denying access to the atomized dataset in association with the another account identifier.
  - 9. The method of claim 1 further comprising managing dataset attributes associated with the atomized dataset.
  - 10. The method of claim 9 wherein managing the dataset attributes comprises:
    - analyzing atomized datasets associated with the collaborative dataset consolidation system; and
      
      identifying a number of queries associated with the atomized dataset.
  - 11. The method of claim 9 wherein managing the dataset attributes comprises:
    - analyzing atomized datasets associated with the collaborative dataset consolidation system; and
      
      identifying a subset of other account identifiers that include descriptive data that correlate to the atomized dataset.
  - 12. The method of claim 11 further comprising:
    - generating a data signal specifying information for at least one of the account identifiers that accessed the descriptive data; and
      
      causing presentation of the information in an activity feed portion of a user interface.
  - 13. The method of claim 9 wherein managing the dataset attributes comprises:
    - analyzing atomized datasets associated with the collaborative dataset consolidation system; and
      
      identifying a subset of other atomized datasets including similar classification types.
  - 14. The method of claim 13 further comprising:
    - generating a data signal specifying information for at least one of the other atomized datasets; and
      
      causing presentation of the information in a recommendation portion of a user interface.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Data.World, Inc.
Original Assignee
Data.World, Inc.
Inventors
Jacob, Bryon Kristen, Griffith, David Lee, Le, Triet Minh, Loyens, Jon, Hurt, Brett A., Keen, Arthur Albert

Granted Patent

US 10,346,429 B2
Time in Patent Office

Days
Field of Search
US Class Current
CPC Class Codes

G06F 16/178   Techniques for file synchro...

G06F 16/21   Design, administration or m...

G06F 16/2365   Ensuring data consistency a...

G06F 16/2455   Query execution

G06F 16/258   Data format conversion from...

G06F 16/273   Asynchronous replication or...

G06F 21/6227   where protection concerns t...

G06F 2221/2141   Access rights, e.g. capabil...

MANAGEMENT OF COLLABORATIVE DATASETS VIA DISTRIBUTED COMPUTER NETWORKS

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

88 Citations

14 Claims

Specification

Solutions

Use Cases

Quick Links

MANAGEMENT OF COLLABORATIVE DATASETS VIA DISTRIBUTED COMPUTER NETWORKS

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

88 Citations

14 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links