System for management of source and derivative data
First Claim
1. A data management system, comprising:
- a source data database containing at least one source data set;
a processing engine for performing a first process to apply of one or more computationally deterministic transformations to the source data set to produce at least one derivative data set, to generate an identifier associated with the produced derivative data set, and to embed the associated identifier within each produced derivative data set;
a derivative data database containing a record of transformations for each derivative data set and all parameters describing each of the transformations;
wherein the embedded identifier comprises means for locating the stored source data set at the derivative data database and means for retrieving said transformations and all the parameters describing each of the transformations from the derivative data database; and
a second process adapted to use the embedded identifier to retrieve the source data set and the transformations stored in the derivative data database and reinitiate the first process to generate additional derivative data for the derivative data set;
wherein the source data set used in the reinitiated process comprises any of the original source data set and at least one alternate version of the original source data set; and
wherein each of the one or more computationally deterministic transformations applied to the source data set in the reinitiated process comprises any of the corresponding original computationally deterministic transformations and at least one alternate version of the corresponding original computationally deterministic transformations.
0 Assignments
0 Petitions
Accused Products
Abstract
Source data is centralized in a database and derivative data sets are formed from the source data. When it is desired to modify derivative data, the source data can be accessed and modified to form a new derivative data set, instead of modifying the prior data set, such that source data integrity is maintained. Tags are associated with derivative data, which can be embedded in the derivative data or associated with the derivative data as an attached element. Tags identify information such as the server that generated the derivative data, the source data and any tasks or transformations that were applied to the source data to generate the derivative data. Users with assigned access privileges to source data can be given access to a source data repository, whereby a number of users can access the source files and modify derivative data files by changes in the source data file.
230 Citations
24 Claims
-
1. A data management system, comprising:
-
a source data database containing at least one source data set;
a processing engine for performing a first process to apply of one or more computationally deterministic transformations to the source data set to produce at least one derivative data set, to generate an identifier associated with the produced derivative data set, and to embed the associated identifier within each produced derivative data set;
a derivative data database containing a record of transformations for each derivative data set and all parameters describing each of the transformations;
wherein the embedded identifier comprises means for locating the stored source data set at the derivative data database and means for retrieving said transformations and all the parameters describing each of the transformations from the derivative data database; and
a second process adapted to use the embedded identifier to retrieve the source data set and the transformations stored in the derivative data database and reinitiate the first process to generate additional derivative data for the derivative data set;
wherein the source data set used in the reinitiated process comprises any of the original source data set and at least one alternate version of the original source data set; and
wherein each of the one or more computationally deterministic transformations applied to the source data set in the reinitiated process comprises any of the corresponding original computationally deterministic transformations and at least one alternate version of the corresponding original computationally deterministic transformations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A data management system, comprising:
-
a process that contains a source data set;
a first server associated with the process, the server including a processing engine, wherein the engine processed the source data set to form a derivative data set, to generate an identifier associated with the formed derivative data set, and to embed the associated identifier within each formed derivative data set;
a storage medium for receiving the derivative data set;
a second server for distributing the derivative data set;
a first database having at least one data structure associated with the source data set; and
a second database having at least one data structure associated with the derivative data set and having data that identifies the second data set as a derivative of the source data set;
wherein the identifier embedded within the formed derivative data set comprises;
means for locating the at least one data structure associated with the source data set; and
means for retrieving the process through which the source data set formed the derivative data set.
-
-
17. A method for managing data, comprising the steps of:
-
providing a source data repository having source data sets;
providing access to at least one user to the source data repository;
forming one additional data repository having a subset of the source data sets from the source data repository, wherein the subset of the source data sets is provided from the user;
receiving requests from the user in the additional data repository to form derivative data sets from the subset of the source data sets;
selectively processing the requests; and
forming derivative data sets in response to the requests, comprising the steps of;
applying one or more computationally deterministic transformations to each of the requested source data sets to form each derivative data set;
generating identifiers each uniquely associated with each formed derivative data set; and
embedding each of the associated identifiers with their corresponding formed derivative data set;
wherein each of the embedded identifiers comprises means for performing the steps of;
locating the requested source data set that corresponds to the corresponding formed derivative data set, and means for locating the sequence of computationally deterministic transformations applied to the requested source data set that corresponds to the corresponding formed derivative data set. - View Dependent Claims (18, 19)
-
-
20. An enhanced data asset, wherein the enhanced data asset is formed by a process performed on a source data set, the enhanced data asset comprising:
-
means for locating at least one data structure associated with the source data set; and
means for retrieving the process through which the source data set formed the derivative data set.
-
-
21. A compound document, comprising:
-
at least one enhanced data asset formed by a process performed on a source data set, wherein the enhanced data asset comprises an embedded identifier comprising means for locating the source data set that corresponds to the corresponding formed enhanced data set, and means for locating the process by which the enhanced data set was formed; and
means for any of retrieving and locating any of the source data set and the process for a corresponding enhanced data asset, based on the embedded identifier.
-
-
22. A process implemented on a computer system, comprising the steps of:
-
creating a presentation file on the computer system;
integrating at least one enhanced data asset within the presentation file, the at least one enhanced data asset formed by a process performed on a source data set, wherein the enhanced data asset comprises an embedded identifier comprising means for performing the steps of locating the source data set that corresponds to the corresponding formed enhanced data set, and for locating the process by which the enhanced data set was formed; and
providing any of retrieving and locating any of the source data set and the process for a corresponding enhanced data asset, based on the embedded identifier. - View Dependent Claims (23, 24)
-
Specification