Methods for mapping data fields from one data set to another in a data processing environment
First Claim
1. In an electronic apparatus, a method of mapping a first data field of a first data set with a data field of a second data set, the method comprising the steps of:
- defining a plurality of field types for commonly used types of data fields;
associating one or more text substrings with each of said field types;
searching for said text substrings in a name for said first data field;
assigning a first field type to said first data field, based on the text substrings, if any, that are contained in said name for said first data field; and
automatically mapping said first data field to a second data field of the second data set that also has said first field type, without requiring user intervention.
1 Assignment
0 Petitions
Accused Products
Abstract
A rule-based methodology is described which supports automatic mapping of data fields between different data sets in a data processing environment. If a field cannot be mapped or matched based on name alone (e.g., an identical match), the methodology employs rules to determine a type for the field, based on the field'"'"'s name. The determined type of the field is then used for matching. The methodology can be employed to match fields which appear dissimilar phonetically but are otherwise appropriate for matching. In the currently-preferred embodiment, rules are stated in form of: regular expression=type. Here, the regular expression member lists the text strings or substring(s) for the field. The rules are ordered in descending preference according to the likelihood that a given rule will correctly identify a field. In this mariner, the methodology allows the task of mapping fields from one data set to another to be entirely automated.
-
Citations
20 Claims
-
1. In an electronic apparatus, a method of mapping a first data field of a first data set with a data field of a second data set, the method comprising the steps of:
-
defining a plurality of field types for commonly used types of data fields;
associating one or more text substrings with each of said field types;
searching for said text substrings in a name for said first data field;
assigning a first field type to said first data field, based on the text substrings, if any, that are contained in said name for said first data field; and
automatically mapping said first data field to a second data field of the second data set that also has said first field type, without requiring user intervention. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
regular expression=type wherein said regular expression comprises at least one text substring describing a particular field.
-
-
5. The method of claim 4, wherein said at least one text substring includes wild card characters for matching text strings having unknown characters.
-
6. The method of claim 5, wherein said wild card characters comprise *, <
- , > and
? characters, where “
*”
matches any string, “
<
” and
“
>
”
match the beginning and ending of a word, respectively, and “
?”
matches any character.
- , > and
-
7. The method of claim 5, wherein matching occurs in a case insensitive fashion.
-
8. The method of claim 1, wherein said plurality of field types includes at least one negative field type, for indicating that no more than one data field of the first data set should be mapped for the particular field type.
-
9. The method of claim 1, further comprising the step of:
receiving input for changing mapping of said first data field from said second data field.
-
10. The method of claim 1, further comprising the step of:
displaying a dialogue indicating that said first data field is mapped to said second data field.
-
11. The method of claim 10, further comprising the steps of:
-
receiving at the dialogue user input for changing mapping of the first data field from the second data field; and
in response to said user input, updating display of the dialogue for showing the user-specified mapping.
-
-
12. The method of claim 1, wherein the first and second data sets contain PIM data.
-
13. A synchronizing apparatus for synchronizing a first data field of a first data set with a data field of a second data set, the apparatus comprising:
-
means for defining a plurality of field types for commonly used types of data fields;
means for associating one or more text substrings with each of said field types;
means for searching for said text substrings in a name for said first data field;
means for assigning a first field type to said first data field, based on the text substrings, if any, that are contained in said name for said first data field;
means for automatically mapping said first data field to a second data field of the second data set that also has said first field type, without requiring user intervention; and
means for synchronizing said first data field of said first data set with said second data field of said second data set. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
regular expression=type wherein said regular expression comprises at least one text substring describing a particular field.
-
-
16. The apparatus of claim 15, wherein said at least one text substring includes wild card characters for matching text strings having unknown characters.
-
17. The apparatus of claim 16, wherein said wild card characters comprise *, <
- , > and
? characters, where “
*”
matches any string, “
<
” and
“
>
”
match the beginning and ending of a word, respectively, and “
?”
matches any character.
- , > and
-
18. The apparatus of claim 16, wherein matching occurs in a case insensitive fashion.
-
19. The apparatus of claim 13, wherein said plurality of field types includes at least one negative field type, for indicating that the apparatus should not map more than one data field of the first data set for the particular field type.
-
20. The apparatus of claim 13, further comprising:
-
means for displaying a dialogue indicating that said first data field is mapped to said second data field;
means for receiving at the dialogue user input for changing mapping of the first data field from the second data field; and
means for updating display of the dialogue for showing the user-specified mapping, in response to said user input.
-
Specification