DETERMINING DATA ELEMENTS IN HETEROGENEOUS SCHEMA DEFINITIONS FOR POSSIBLE MAPPING
First Claim
1. A method of generating a plurality of synonym pairs from a first schema and a second schema, said first schema including a first plurality of elements according to a first hierarchy and said second schema including a second plurality of elements according to a second hierarchy, wherein each synonym pair contains a leaf element from each of said first schema and said second schema and wherein the two leaf elements correspond to each other, said method comprising:
- receiving a first data indicating that a pair of non-leaf elements are structurally similar, said pair of non-leaf elements containing a first non-leaf element and a second non-leaf element respectively contained in said first schema and said second schema; and
computing a probability of possible match between a first leaf element and a second leaf element respectively contained in said first schema and said second schema, wherein said probability of possible match as a synonym pair is greater if said first leaf element is in a branch from said first non-leaf element in said first hierarchy and said second leaf element is in another branch from said second non-leaf element in said second hierarchy, than otherwise.
1 Assignment
0 Petitions
Accused Products
Abstract
Determining data elements for possible mapping in heterogeneous schema definitions. According to one aspect of the present invention, a user indicates whether two non-leaf elements (in respective schemas) are structurally similar, and the probability of possible match of a first element (in a first schema) and a second element (in a second schema) as a synonym pair is computed to be more if the two elements are below the respective ones of the structurally similar nodes, compared to in a situation in which the elements are not present in such hierarchies.
-
Citations
8 Claims
-
1. A method of generating a plurality of synonym pairs from a first schema and a second schema, said first schema including a first plurality of elements according to a first hierarchy and said second schema including a second plurality of elements according to a second hierarchy, wherein each synonym pair contains a leaf element from each of said first schema and said second schema and wherein the two leaf elements correspond to each other, said method comprising:
-
receiving a first data indicating that a pair of non-leaf elements are structurally similar, said pair of non-leaf elements containing a first non-leaf element and a second non-leaf element respectively contained in said first schema and said second schema; and
computing a probability of possible match between a first leaf element and a second leaf element respectively contained in said first schema and said second schema, wherein said probability of possible match as a synonym pair is greater if said first leaf element is in a branch from said first non-leaf element in said first hierarchy and said second leaf element is in another branch from said second non-leaf element in said second hierarchy, than otherwise. - View Dependent Claims (2, 3, 4)
-
-
5. A computer readable medium carrying one or more sequences of instructions causing a system to generate a plurality of synonym pairs from a first schema and a second schema, said first schema including a first plurality of elements according to a first hierarchy and said second schema including a second plurality of elements according to a second hierarchy, wherein each synonym pair contains a leaf element from each of said first schema and said second schema and wherein the two leaf elements correspond to each other, and execution of said one or more sequences of instructions by one or more processors contained in said server causes said one or more processors to perform the actions of:
-
receiving a first data indicating that a pair of non-leaf elements are structurally similar, said pair of non-leaf elements containing a first non-leaf element and a second non-leaf element respectively contained in said first schema and said second schema; and
computing a probability of possible match between a first leaf element and a second leaf element respectively contained in said first schema and said second schema, wherein said probability of possible match as a synonym pair is greater if said first leaf element is in a branch from said first non-leaf element in said first hierarchy and said second leaf element is in another branch from said second non-leaf element in said second hierarchy, than otherwise. - View Dependent Claims (6, 7, 8)
-
Specification