Methods for mapping data fields from one data set to another in a data processing environment
First Claim
1. In a data processing system, a method for mapping data fields from a source data set to a destination data set, in the method comprising:
- defining a set of rules comprising a plurality of field types, wherein each field type is associated with one or more string or substring expressions specifying a field name;
ordering said rules according to how likely a given rule will correctly identify a field;
receiving a source data set comprising a plurality of data fields;
mapping data fields of the source data set to data fields of the destination data set by determining a field type for each source data field, based on application of the rules; and
synchronizing the source data set with the destination data set, so that values from data fields of the source data set are synchronized with field values from corresponding data fields of the destination data set.
1 Assignment
0 Petitions
Accused Products
Abstract
A rule-based methodology is described which supports automatic mapping of data fields between different data sets in a data processing environment. If a field cannot be mapped or matched based on name alone (e.g., an identical match), the methodology employs rules to determine a type for the field, based on the field'"'"'s name. The determined type of the field is then used for matching. The methodology can be employed to match fields which appear dissimilar phonetically but are otherwise appropriate for matching. In the currently-preferred embodiment, rules are stated in form of: regular expression= type. Here, the regular expression member lists the text strings or substring(s) for the field. The rules are ordered in descending preference according to the likelihood that a given rule will correctly identify a field. In this manner, the methodology allows the task of mapping fields from one data set to another to be entirely automated.
-
Citations
25 Claims
-
1. In a data processing system, a method for mapping data fields from a source data set to a destination data set, in the method comprising:
-
defining a set of rules comprising a plurality of field types, wherein each field type is associated with one or more string or substring expressions specifying a field name;
ordering said rules according to how likely a given rule will correctly identify a field;
receiving a source data set comprising a plurality of data fields;
mapping data fields of the source data set to data fields of the destination data set by determining a field type for each source data field, based on application of the rules; and
synchronizing the source data set with the destination data set, so that values from data fields of the source data set are synchronized with field values from corresponding data fields of the destination data set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A data processing system providing automatic mapping of data fields from one data set to another comprising:
-
a computer capable of storing and processing a source data set and a destination data set;
means for storing a set of rules comprising a plurality of field types, wherein each field type is associated with a particular type of field usually present in user data;
means for ordering said rules according to how likely a given rule will correctly identify a field;
means for mapping data fields of the source data set to data fields of the destination data set by associating each data field of the source data set with a corresponding data field of the destination data set, based on a field type determined for each data field from application of the rules; and
means for synchronizing the source data set with the destination data set, so that values from data fields of the source data set are synchronized with field values from associated data fields of the destination data set. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A storage device storing a computer program providing methodology for automatic mapping of data fields among multiple devices, the computer program comprising:
-
program steps for storing mapping rules specifying how each field type present in a source data set is to be mapped into a destination data set, wherein each field type is associated with a particular type of data field usually present in user data;
program steps for ordering said rules according to how likely a given rule will correctly identify a field; and
program steps for mapping data fields of the source data set to data fields of the destination data set by associating, based on said mapping rules, each data field of the source data set with a corresponding data field of the destination data set, so that during synchronization of the source data set with the destination data set, values from data fields of the source data set are synchronized with values from associated data fields of the destination data set. - View Dependent Claims (22, 23, 24, 25)
-
Specification