Method and apparatus for data integration and management
First Claim
1. A system for maintaining master reference data relating to entities, said system comprising:
- a master reference data storage for storing a plurality of master reference data sets based on reference data sets from different data storages, wherein each reference data set identifies a particular entity, wherein each of a plurality of the entities is identified by a plurality of reference data sets stored in a plurality of the data storages, wherein a master reference data set for a particular entity represents a most reliable reference data set stored for the particular entity in any one of the different data storages; and
a master reference manager for receiving a reference data set in real time, updating the master reference data set in real time based on the received reference data set, and maintaining content metadata for the master reference data set, wherein the master reference manager includes a trust framework for updating the master reference data set in real time, wherein the master reference data set includes a set of data fields, wherein updating the master reference data set comprises changing a current value of a data field of the master reference data set when the reference data set includes another value for the data field that is more reliable than the current value of the data field.
6 Assignments
0 Petitions
Accused Products
Abstract
A system for maintaining master reference data for entities is disclosed. The system includes multiple reference data sets at multiple different data storages. A reference data set for a particular entity uniquely identifies the particular entity. The system also includes a first master reference data set at a first data storage that is at least as reliable as a second reference data set at a second data storage. In some embodiments, the first data storage can be updated through real-time process or an offline process (e.g., a batch process). The first master reference data set includes at least one data record, and content metadata regarding the data record. In some embodiments, content metadata comprises lineage data that includes each preceding value that was contained in the data record. In some of these embodiments, lineage includes other factors that affected the present and previous values contained in the data record.
-
Citations
22 Claims
-
1. A system for maintaining master reference data relating to entities, said system comprising:
-
a master reference data storage for storing a plurality of master reference data sets based on reference data sets from different data storages, wherein each reference data set identifies a particular entity, wherein each of a plurality of the entities is identified by a plurality of reference data sets stored in a plurality of the data storages, wherein a master reference data set for a particular entity represents a most reliable reference data set stored for the particular entity in any one of the different data storages; and a master reference manager for receiving a reference data set in real time, updating the master reference data set in real time based on the received reference data set, and maintaining content metadata for the master reference data set, wherein the master reference manager includes a trust framework for updating the master reference data set in real time, wherein the master reference data set includes a set of data fields, wherein updating the master reference data set comprises changing a current value of a data field of the master reference data set when the reference data set includes another value for the data field that is more reliable than the current value of the data field. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for maintaining master reference data relating to entities, said system comprising:
-
a master reference data storage for storing a plurality of master reference data sets based on reference data sets from different data storages, wherein each reference data set identifies a particular entity, wherein each of a plurality of the entities is identified by a plurality of reference data sets stored in a plurality of the data storages, wherein a master reference data set for a particular entity represents a most reliable reference data set stored for the particular entity in any one of the different data storages; and a master reference manager for receiving a reference data set in real time, merging the master reference data set in real time based on the received reference data set, and maintaining content metadata for the master reference data set, wherein the master reference manager includes a trust framework for merging the master reference data set in real time, wherein the master reference data set includes a set of data fields, wherein merging the master reference data set comprises changing a current value of a data field of the master reference data set when the master reference manager identifies that the reference data set includes another value for the data field that is more reliable than the current value of the data field in the master reference data set. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A system for maintaining master reference data relating to entities, said system comprising:
-
a master reference data storage for storing a plurality of master reference data sets based on reference data sets from different data storages, wherein each reference data set identifies a particular entity, wherein each of a plurality of the entities is identified by a plurality of reference data sets stored in a plurality of the data storages, wherein a master reference data set for a particular entity represents a most reliable reference data set stored for the particular entity in any one of the different data storages; and a master reference manager for receiving a reference data set in real time, modifying the master reference data set in real time based on the received reference data set, and maintaining content metadata for the master reference data set, wherein the master reference manager includes a trust framework for merging two master reference data sets in real time, wherein the master reference manager merges the two master reference data sets by determining whether the two master reference data sets include a first set of data values that match, wherein when the two master reference data sets do not include the first set of data values that match but include a second set of data values that match, the master reference manager identifies the two master reference data sets as potentially matching master reference data sets that should be examined by a system operator to determine whether the two master reference data sets are matching and need to be merged. - View Dependent Claims (14)
-
-
15. A system for maintaining master reference data relating to entities, said system comprising:
-
a master reference data storage for storing a plurality of master reference data sets based on reference data sets from different data storages, wherein each reference data set identifies a particular entity, wherein each of a plurality of the entities is identified by a plurality of reference data sets stored in a plurality of the data storages, wherein a master reference data set for a particular entity represents a most reliable reference data set stored for the particular entity in any one of the different data storages; and a master reference manager for receiving a reference data set in real time, modifying the master reference data set in real time based on the received reference data set, and maintaining content metadata for the master reference data set, wherein the master reference manager includes a trust framework for merging first and second master reference data sets into a third master reference data set, and performing an unmerge operation that divides the third master reference data set into the first and second master reference data sets that at one time were merged to produce the third master reference data set. - View Dependent Claims (16, 17)
-
-
18. A method of maintaining master reference data relating to entities, the method comprising:
-
storing a plurality of master reference data sets based on reference data sets from different data storages, wherein each of a plurality of the entities is identified by a plurality of reference data sets stored in a plurality of the data storages, wherein a master reference data set for a particular entity represents a most reliable reference data set stored for the particular entity in any one of the different data storages; receiving a reference data set in real time updating the master reference data set in real time based on the received reference data set, wherein the master reference data set includes a set of data fields, wherein updating the master reference data set comprises changing a current value of a data field of the master reference data set when the reference data set includes another value for the data field that is more reliable than the current value of the data field; and maintaining content metadata for the master reference data set. - View Dependent Claims (19, 20, 21, 22)
-
Specification