DYNAMIC COMPOSITE DATA DICTIONARY TO FACILITATE DATA OPERATIONS VIA COMPUTERIZED TOOLS CONFIGURED TO ACCESS COLLABORATIVE DATASETS IN A NETWORKED COMPUTING PLATFORM
First Claim
1. A method comprising:
- receiving data representing a dataset into dataset ingestion controller;
identifying a first data arrangement in which the data representing the dataset has a first format;
analyzing the data representing the dataset to determine a first subset of identifiers for subsets of data;
forming a first data dictionary including the first subset of identifiers for the subsets of data in the dataset;
formatting the dataset into a second data arrangement having a second format;
receiving data originating at a data project user interface to link the dataset to another dataset, which is associated with a second data dictionary; and
forming a composite data dictionary including the first data dictionary and the second data dictionary.
1 Assignment
0 Petitions
Accused Products
Abstract
Various embodiments relate generally to data science and data analysis, computer software and systems, network communications to interface among repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate data projects by providing an interactive, project-centric workspace interface that may include, for example, a unified view in which to identify data sources, generate transformative datasets, and/form queries over a composite data dictionary coupled to collaborative computing devices and user accounts. For example, a method may include forming a first data dictionary, linking a dataset associated with the first data dictionary to another dataset, which may be associated with a second data dictionary, and forming a dynamic composite data dictionary.
73 Citations
20 Claims
-
1. A method comprising:
-
receiving data representing a dataset into dataset ingestion controller; identifying a first data arrangement in which the data representing the dataset has a first format; analyzing the data representing the dataset to determine a first subset of identifiers for subsets of data; forming a first data dictionary including the first subset of identifiers for the subsets of data in the dataset; formatting the dataset into a second data arrangement having a second format; receiving data originating at a data project user interface to link the dataset to another dataset, which is associated with a second data dictionary; and forming a composite data dictionary including the first data dictionary and the second data dictionary. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. An apparatus comprising:
-
a memory including executable instructions; and a processor, responsive to executing the instructions, is configured to; receive data representing a dataset into dataset ingestion controller; identify a first data arrangement in which the data representing the dataset has a first format; analyze the data representing the dataset to determine a first subset of identifiers for subsets of data; form a first data dictionary including the first subset of identifiers for the subsets of data in the dataset; format the dataset into a second data arrangement having a second format; receive data originating at a data project user interface to link the dataset to another dataset, which is associated with a second data dictionary; and form a composite data dictionary including the first data dictionary and the second data dictionary. - View Dependent Claims (18, 19, 20)
-
Specification