Methods for mapping data fields from one data set to another in a data processing environment
First Claim
1. In a data processing system, a method for mapping data fields from a source data set to a destination data set, in the method comprising:
- defining a set of rules comprising a plurality of field types, wherein each field type is associated with one or more string or substring expressions specifying a field name for facilitating data field mapping without user intervention;
ordering said rules according to how likely a given rule will correctly identify a field;
receiving a source data set comprising a plurality of data fields;
without requiring user intervention, mapping data fields of the source data set to data fields of the destination data set by determining a field type for each source data field, based on application of the rules; and
synchronizing the source data set with the destination data set, so that values from data fields of the source data set are synchronized with field values from corresponding data fields of the destination data set.
2 Assignments
0 Petitions
Accused Products
Abstract
A rule-based methodology is described which supports automatic mapping of data fields between different data sets in a data processing environment. If a field cannot be mapped or matched based on name alone (e.g., an identical match), the methodology employs rules to determine a type for the field, based on the field'"'"'s name. The determined type of the field is then used for matching. The methodology can be employed to match fields which appear dissimilar phonetically but are otherwise appropriate for matching. In the currently-preferred embodiment, rules are stated in form of: regular expression=type. Here, the regular expression member lists the text strings or substring(s) for the field. The rules are ordered in descending preference according to the likelihood that a given rule will correctly identify a field. In this manner, the methodology allows the task of mapping fields from one data set to another to be entirely automated.
-
Citations
25 Claims
-
1. In a data processing system, a method for mapping data fields from a source data set to a destination data set, in the method comprising:
-
defining a set of rules comprising a plurality of field types, wherein each field type is associated with one or more string or substring expressions specifying a field name for facilitating data field mapping without user intervention;
ordering said rules according to how likely a given rule will correctly identify a field;
receiving a source data set comprising a plurality of data fields;
without requiring user intervention, mapping data fields of the source data set to data fields of the destination data set by determining a field type for each source data field, based on application of the rules; and
synchronizing the source data set with the destination data set, so that values from data fields of the source data set are synchronized with field values from corresponding data fields of the destination data set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
regular expression=type wherein said regular expression comprises at least one text substring describing a particular field.
-
-
4. The method of claim 3, wherein said at least one text substring includes wild card characters for matching text strings having unknown characters.
-
5. The method of claim 4, wherein said wild card characters comprise *, <
- ,> and
? characters, where “
*”
matches any string, “
<
” and
“
>
”
match the beginning and ending of a word, respectively, and “
?”
matches any character.
- ,> and
-
6. The method of claim 5, wherein matching occurs in a case insensitive fashion.
-
7. The method of claim 1, wherein said plurality of field types includes at least one negative field type, for indicating that the system should not map more than one data field of the source data set for a particular field type.
-
8. The method of claim 1, further comprising:
receiving input for changing mapping of at least one data field from that which has been automatically selected by the system.
-
9. The method of claim 1, further comprising:
displaying a dialogue indicating how data fields are mapped between the source data set and the destination data set.
-
10. The method of claim 9, further comprising:
-
receiving at the dialogue user input for changing mapping of at least one data field from that which has been automatically selected by the system; and
in response to said user input, updating display of the dialogue for showing the user-specified mapping.
-
-
11. A data processing system providing automatic mapping of data fields from one data set to another comprising:
-
a computer capable of storing and processing a source data set and a destination data set;
means for storing a set of rules comprising a plurality of field types, wherein each field type is associated with a particular type of field usually present in user data for facilitating data field mapping without user intervention;
means for ordering said rules according to how likely a given rule will correctly identify a field;
means for mapping data fields of the source data set to data fields of the destination data set without user intervention by associating each data field of the source data set with a corresponding data field of the destination data set, based on a field type determined for each data field from application of the rules; and
means for synchronizing the source data set with the destination data set, so that values from data fields of the source data set are synchronized with field values from associated data fields of the destination data set. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
regular expression=type wherein said regular expression comprises at least one text substring describing a particular field.
-
-
14. The system of claim 13, wherein said at least one text substring includes wild card characters for matching text strings having unknown characters.
-
15. The system of claim 14, wherein said wild card characters comprise *, <
- , > and
? characters, where “
*”
matches any string, “
<
” and
“
>
”
match the beginning and ending of a word, respectively, and “
?”
matches any character.
- , > and
-
16. The system of claim 15, wherein matching is performed by the system in a case insensitive fashion.
-
17. The system of claim 11, wherein said plurality of field types includes at least one negative field type, for indicating that the system should not map more than one data field of the source data set for a particular field type.
-
18. The system of claim 11, further comprising:
input means for changing mapping of at least one data field from that which has been automatically selected by the system.
-
19. The system of claim 11, further comprising:
means for displaying a dialogue indicating how data fields are mapped between the source data set and the destination data set.
-
20. The system of claim 19, further comprising:
-
means for receiving at the dialogue user input for changing mapping of at least one data field from that which has been automatically selected by the system; and
means, responsive to said user input, for updating display of the dialogue for showing the user-specified mapping.
-
-
21. A storage device storing a computer program providing methodology for automatic mapping of data fields among multiple devices, the computer program comprising:
-
program steps for storing mapping rules specifying how each field type present in a source data set is to be mapped into a destination data set for facilitating data field mapping without user intervention, wherein each field type is associated with a particular type of data field usually present in user data;
program steps for ordering said rules according to how likely a given rule will correctly identify a field; and
program steps for mapping data fields of the source data set to data fields of the destination data set without user intervention by associating, based on said mapping rules, each data field of the source data set with a corresponding data field of the destination data set, so that during synchronization of the source data set with the destination data set, values from data fields of the source data set are synchronized with values from associated data fields of the destination data set. - View Dependent Claims (22, 23, 24, 25)
regular expression=type wherein said regular expression comprises at least one text substring matching a name of a particular data field.
-
-
23. The storage device of claim 21, wherein said mapping rules for a plurality of devices are stored in a table, such that each row of the table specifies a mapping rule for a particular type of data field present among the multiple devices.
-
24. The storage device of claim 23, wherein each mapping rule comprises a text expression suitable for determining a type of data field based on the data field'"'"'s name.
-
25. The storage device of claim 24, wherein said text expression comprises at least one text substring correlating to an expected name for a particular type of data field.
Specification