Systems and methods for node deduplication based on a node merging policy
First Claim
1. A method, comprising:
- maintaining, by one or more processors, a first node profile including a plurality of first field-value pairs, each first field-value pair of the plurality of first field-value pairs identifying a corresponding first field and a corresponding first value;
maintaining, by the one or more processors, a plurality of second node profiles, each second node profile of the plurality of second node profiles including a plurality of second field-value pairs, each second field-value pair of the plurality of second field-value pairs identifying a corresponding second field and a corresponding second value;
assigning, by the one or more processors, to each first field-value pair of the first node profile, a respective weight based on i) a number of second field-value pairs included in the plurality of second node profiles that match the first field-value pair of the first node profile or ii) a confidence score of the first value of the first field-value pair, the confidence score based on a number of data points that i) identify the first field-value pair and ii) are stored in association with the first node profile;
determining, by the one or more processors, using the respective weight assigned to each first field-value pair of the first node profile, that the first node profile and a candidate second node profile of the plurality of second node profiles satisfy a node profile merging policy; and
merging, by the one or more processors, the first node profile and the candidate second node profile responsive to determining that the first node profile and the candidate second node profile satisfy the node profile merging policy.
2 Assignments
0 Petitions
Accused Products
Abstract
The present disclosure relates to node deduplication based on a node merging policy. A plurality of node profiles may be maintained. Each node profile may include plurality of fields. Each field of the plurality of fields may include one or more values. For a node profile including one or more field-value pairs, each field-value pair may be identified. Each field-value pair may include a value of a field. A respective weight may be assigned to each field-value pair based on a number of field-value pairs or a confidence score. Based on the respective weights assigned to each field-value pair, it may be determined that node profiles satisfy a node profile merging policy. The node profiles may be merged based on the node profiles satisfying the node profile merging policy. Field-value pairs including a value of a field may be identified.
375 Citations
20 Claims
-
1. A method, comprising:
-
maintaining, by one or more processors, a first node profile including a plurality of first field-value pairs, each first field-value pair of the plurality of first field-value pairs identifying a corresponding first field and a corresponding first value; maintaining, by the one or more processors, a plurality of second node profiles, each second node profile of the plurality of second node profiles including a plurality of second field-value pairs, each second field-value pair of the plurality of second field-value pairs identifying a corresponding second field and a corresponding second value; assigning, by the one or more processors, to each first field-value pair of the first node profile, a respective weight based on i) a number of second field-value pairs included in the plurality of second node profiles that match the first field-value pair of the first node profile or ii) a confidence score of the first value of the first field-value pair, the confidence score based on a number of data points that i) identify the first field-value pair and ii) are stored in association with the first node profile; determining, by the one or more processors, using the respective weight assigned to each first field-value pair of the first node profile, that the first node profile and a candidate second node profile of the plurality of second node profiles satisfy a node profile merging policy; and merging, by the one or more processors, the first node profile and the candidate second node profile responsive to determining that the first node profile and the candidate second node profile satisfy the node profile merging policy. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system comprising:
one or more processors configured by machine-readable instructions to; maintain a first node profile including a plurality of first field-value pairs, each first field-value pair of the plurality of first field-value pairs identifying a corresponding first field and a corresponding first value; maintain a plurality of second node profiles, each second node profile of the plurality of second node profiles including a plurality of second field-value pairs, each second field-value pair of the plurality of second field-value pairs identifying a corresponding second field and a corresponding second value; assign, to each first field-value pair of the first node profile, a respective weight based on i) a number of second field-value pairs included in the plurality of second node profiles that match the first field-value pair of the first node profile or ii) a confidence score of the first value of the first field-value pair, the confidence score based on a number of data points that i) identify the first field-value pair and ii) are stored in association with the first node profile; determine, using the respective weight assigned to each first field-value pair of the first node profile, that the first node profile and candidate second node profile of the plurality of second node profiles satisfy a node profile merging policy; and merge the first node profile and the candidate second node profile responsive to determining that the first node profile and the candidate second node profile satisfy the node profile merging policy. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
20. A non transitory computer-readable storage medium having instructions embodied thereon, the instructions being executable by one or more processors to:
-
maintain a first node profile including a plurality of first field-value pairs, each first field-value pair of the plurality of first field-value pairs identifying a corresponding first field and a corresponding first value; maintain a plurality of second node profiles, each second node profile of the plurality of second node profiles including a plurality of second field-value pairs, each second field-value pair of the plurality of second field-value pairs identifying a corresponding second field and a corresponding second value; assign, to each first field-value pair of the first node profile, a respective weight based on i) a number of second field-value pairs included in the plurality of second node profiles that match the first field-value pair of the first node profile or ii) a confidence score of the first value of the first field-value pair, the confidence score based on a number of data points that i) identify the first field-value pair and ii) are stored in association with the first node profile; determine, using the respective weight assigned to each first field-value pair of the first node profile, that the first node profile and candidate second node profile of the plurality of second node profiles satisfy a node profile merging policy; and merge the first node profile and the candidate second node profile responsive to determining that the first node profile and the candidate second node profile satisfy the node profile merging policy.
-
Specification