MACHINE-ASSISTED OBJECT MATCHING
First Claim
1. A method for managing data using one or more processors, included in one or more network computers, to perform actions, comprising:
- providing a principal data set and another data set, wherein the principal data set includes a plurality of principal objects and the other data set includes a plurality of other objects;
instantiating a first engine to perform actions including;
associating each of the plurality of principal objects with one or more other objects that are selected from the plurality of other objects, wherein the one or more other objects are selected as potential matches to one or more of the principal objects;
selecting one or more principal objects of the plurality of principal objects based on one or more heuristics and the one or more selected principal objects association with selected other objects; and
labeling one or more of the selected other objects as either a match or a non-match of their associated principal object; and
instantiating a second engine to perform actions, including;
training a ranker to identify a selected other object that is top-ranked in similarity to its association with the one or more principal objects based on the labeling of the one or more selected other objects; and
employing the trained ranker to indicate a match among the plurality of principal objects and the plurality of other objects, wherein each matched other object is a selected other object that is ranked higher than selected other objects associated with a same principal object.
2 Assignments
0 Petitions
Accused Products
Abstract
Embodiments are directed towards managing data using modeling platform. A principal data set and match data set may be provided. The principal data set may include principal objects and the match data set may include other model objects. Blocking may associate each of the plurality of principal objects with potential match objects. Principal objects from the plurality of principal objects may be selected based on heuristics, their associated potential match objects, or the like. Potential match objects may be labeled as a true match or a non-match with respect to their associated principal object. A ranker may be trained to identify top-ranked matches based on the labeled potential match objects. The trained ranker may be employed to indicate matches where each matching other model object is a potential match object that is ranked higher than other potential match objects that are associated with its matched principal object.
35 Citations
30 Claims
-
1. A method for managing data using one or more processors, included in one or more network computers, to perform actions, comprising:
-
providing a principal data set and another data set, wherein the principal data set includes a plurality of principal objects and the other data set includes a plurality of other objects; instantiating a first engine to perform actions including; associating each of the plurality of principal objects with one or more other objects that are selected from the plurality of other objects, wherein the one or more other objects are selected as potential matches to one or more of the principal objects; selecting one or more principal objects of the plurality of principal objects based on one or more heuristics and the one or more selected principal objects association with selected other objects; and labeling one or more of the selected other objects as either a match or a non-match of their associated principal object; and instantiating a second engine to perform actions, including; training a ranker to identify a selected other object that is top-ranked in similarity to its association with the one or more principal objects based on the labeling of the one or more selected other objects; and employing the trained ranker to indicate a match among the plurality of principal objects and the plurality of other objects, wherein each matched other object is a selected other object that is ranked higher than selected other objects associated with a same principal object. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for managing data, comprising:
-
a network computer, comprising; a transceiver that communicates over the network; a memory that stores at least instructions; and one or more processor devices that execute instructions that perform actions, including; providing a principal data set and another data set, wherein the principal data set includes a plurality of principal objects and the other data set includes a plurality of other objects; instantiating a first engine to perform actions including; associating each of the plurality of principal objects with one or more other objects that are selected from the plurality of other objects, wherein the one or more other objects are selected as potential matches to one or more of the principal objects; selecting one or more principal objects of the plurality of principal objects based on one or more heuristics and the one or more selected principal objects association with selected other objects; and labeling one or more of the selected other objects as either a match or a non-match of their associated principal object; and instantiating a second engine to perform actions, including; training a ranker to identify a selected other object that is top-ranked in similarity to its association with the one or more principal objects based on the labeling of the one or more selected other objects; and employing the trained ranker to indicate a match among the plurality of principal objects and the plurality of other objects, wherein each matched other object is a selected other object that is ranked higher than selected other objects associated with a same principal object; and a client computer, comprising; a client computer transceiver that communicates over the network; a client computer memory that stores at least instructions; and one or more processor devices that execute instructions that perform actions, including; displaying one or more of the one or more principal objects and the one or more selected other objects on a display of the client computer, wherein the displayed selected other objects are associated with a same principal object. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A processor readable non-transitory storage media that includes instructions for managing data, wherein execution of the instructions by one or more hardware processors performs actions, comprising:
-
providing a principal data set and another data set, wherein the principal data set includes a plurality of principal objects and the other data set includes a plurality of other objects; instantiating a first engine to perform actions including; associating each of the plurality of principal objects with one or more other objects that are selected from the plurality of other objects, wherein the one or more other objects are selected as potential matches to one or more of the principal objects; selecting one or more principal objects of the plurality of principal objects based on one or more heuristics and the one or more selected principal objects association with selected other objects; and labeling one or more of the selected other objects as either a match or a non-match of their associated principal object; and instantiating a second engine to perform actions, including; training a ranker to identify a selected other object that is top-ranked in similarity to its association with the one or more principal objects based on the labeling of the one or more selected other objects; and employing the trained ranker to indicate a match among the plurality of principal objects and the plurality of other objects, wherein each matched other object is a selected other object that is ranked higher than selected other objects associated with a same principal object. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
-
24. A network computer for managing data, comprising:
-
a transceiver that communicates over the network; a memory that stores at least instructions; and one or more processor devices that execute instructions that perform actions, including; providing a principal data set and another data set, wherein the principal data set includes a plurality of principal objects and the other data set includes a plurality of other objects; instantiating a first engine to perform actions including; associating each of the plurality of principal objects with one or more other objects that are selected from the plurality of other objects, wherein the one or more other objects are selected as potential matches to one or more of the principal objects; selecting one or more principal objects of the plurality of principal objects based on one or more heuristics and the one or more selected principal objects association with selected other objects; and labeling one or more of the selected other objects as either a match or a non-match of their associated principal object; and instantiating a second engine to perform actions, including; training a ranker to identify a selected other object that is top-ranked in similarity to its association with the one or more principal objects based on the labeling of the one or more selected other objects; and employing the trained ranker to indicate a match among the plurality of principal objects and the plurality of other objects, wherein each matched other object is a selected other object that is ranked higher than selected other objects associated with a same principal object. - View Dependent Claims (25, 26, 27, 28, 29, 30)
-
Specification