Methods and apparatus for fusing databases
First Claim
Patent Images
1. A method of fusing first and second datasets, comprising:
- determining an importance ranking of a plurality of variables associated with the first and second datasets;
generating a hierarchical matching grid including a plurality of levels based on the importance ranking of the plurality of variables, wherein each of the levels defines match criteria for satisfying a matching records condition by indicating which of the variables are to match;
identifying first and second sets of match candidates from the first and second datasets based on one of the plurality of levels of the hierarchical matching grid; and
fusing records in the first and second sets of match candidates based on probabilities associated with the records.
12 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus for fusing multiple databases into a single database are disclosed. A disclosed method determines a ranking of a plurality of matching variables associated with first and second datasets and generates a hierarchical matching grid including a plurality of levels based on the ranking of the plurality of matching variables. The example method identifies first and second sets of match candidates from the first and second datasets based on successive levels of the hierarchical matching grid and fuses records in the first and second sets of match candidates based on probabilities associated with the records.
-
Citations
20 Claims
-
1. A method of fusing first and second datasets, comprising:
-
determining an importance ranking of a plurality of variables associated with the first and second datasets; generating a hierarchical matching grid including a plurality of levels based on the importance ranking of the plurality of variables, wherein each of the levels defines match criteria for satisfying a matching records condition by indicating which of the variables are to match; identifying first and second sets of match candidates from the first and second datasets based on one of the plurality of levels of the hierarchical matching grid; and fusing records in the first and second sets of match candidates based on probabilities associated with the records. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for fusing first and second datasets, comprising:
-
a memory; and a processor coupled to the memory and configured to; determine an importance ranking of a plurality of variables associated with the first and second datasets; generate a hierarchical matching grid including a plurality of levels based on the importance ranking of the plurality of variables, wherein each of the levels defines match criteria for satisfying a matching records condition by indicating which of the variables are to match; identify first and second sets of match candidates from the first and second datasets based on one of the plurality of levels of the hierarchical matching grid; and fuse records in the first and second sets of match candidates based on probabilities associated with the records. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A machine readable medium having instructions stored thereon that, when executed, cause a machine to:
-
determine an importance ranking of a plurality of variables associated with first and second datasets; generate a hierarchical matching grid including a plurality of levels based on the importance ranking of the plurality of variables, wherein each of the levels defines match criteria for satisfying a matching records condition by indicating which of the variables are to match; identify first and second sets of match candidates from the first and second datasets based on one of the plurality of levels of the hierarchical matching grid; and fuse records in the first and second sets of match candidates based on probabilities associated with the records. - View Dependent Claims (18, 19, 20)
-
Specification