Automatic match tuning
First Claim
1. A computer program product, tangibly embodied in an information carrier, for identifying matches between disparate schemas, the computer program product being operable to cause data processing apparatus to:
- calculate a degree of similarity between elements of two schemas using each of a plurality of matching processes;
combine the calculated degrees of similarity using a first weighting vector to produce first combined degrees of similarity, with the first weighting vector including a plurality of weighting coefficients and each weighting coefficient corresponding to one of the plurality of matching processes; and
tune the weighting coefficients using information relating to a predicted degree of matching accuracy associated with the first weighting vector.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and apparatus, including computer program products, for identifying matches between disparate schemas calculates a degree of similarity between elements of two schemas using each of multiple matching processes. The calculated degrees of similarity are combined using a first weighting vector to produce first combined degrees of similarity. The first weighting vector includes multiple weighting coefficients and each weighting coefficient corresponds to one of the matching processes. The weighting coefficients are tuned using information relating to a predicted degree of matching accuracy associated with the first weighting vector.
-
Citations
25 Claims
-
1. A computer program product, tangibly embodied in an information carrier, for identifying matches between disparate schemas, the computer program product being operable to cause data processing apparatus to:
-
calculate a degree of similarity between elements of two schemas using each of a plurality of matching processes;
combine the calculated degrees of similarity using a first weighting vector to produce first combined degrees of similarity, with the first weighting vector including a plurality of weighting coefficients and each weighting coefficient corresponding to one of the plurality of matching processes; and
tune the weighting coefficients using information relating to a predicted degree of matching accuracy associated with the first weighting vector. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for identifying matches between disparate schemas, the method comprising:
-
calculating a degree of similarity between elements of two schemas using each of a plurality of matching processes;
combining the calculated degrees of similarity using each of a plurality of weighting vectors, with each weighting vector including a plurality of weighting coefficients and each weighting coefficient corresponding to one of the plurality of matching processes;
determining, using the combined degrees of similarity, a level of ambiguity for each weighting vector; and
selecting a particular weighting vector to determine possible matches between the elements of the two schemas, wherein the particular weighting vector is selected based on the level of ambiguity for each weighting vector. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method for identifying matches between disparate schemas, the method comprising:
-
calculating a degree of similarity between elements of two schemas using each of a plurality of matching processes;
combining the calculated degrees of similarity using a first weighting vector to produce first combined degrees of similarity, with the first weighting vector including a plurality of weighting coefficients and each weighting coefficient corresponding to one of the plurality of matching processes;
identifying a set of possible matches between the elements of the two schemas based on the first combined degrees of similarity;
receiving user feedback relating to a subset of the possible matches;
modifying the first weighting vector based on the user feedback to produce a second weighting vector;
combining the calculated degrees of similarity using the second weighting vector to produce second combined degrees of similarity; and
identifying a modified set of possible matches between the elements of the two schemas based on the second combined degrees of similarity. - View Dependent Claims (21)
-
-
22. A system for identifying matches between disparate schemas, the system comprising:
-
means for calculating a degree of similarity between elements of two schemas using each of a plurality of matching processes;
means for combining the calculated degrees of similarity using a first weighting vector to produce first combined degrees of similarity, with the first weighting vector including a plurality of weighting coefficients and each weighting coefficient corresponding to one of the plurality of matching processes; and
means for tuning the weighting coefficients using information relating to a predicted degree of matching accuracy associated with the first weighting vector. - View Dependent Claims (23, 24, 25)
-
Specification