Node clustering
First Claim
1. A method, implemented at least in part via a processing unit, for identifying one or more node clusters, comprising:
- receiving a set of node pairings corresponding to a plurality of nodes, a node pairing comprising a first node connected to a second node; and
transforming the set of node pairings one or more times until one or more node clusters are identified from the plurality of nodes, where nodes within a node cluster are paired with a base node within the plurality of nodes by a single connection and are connected to no other nodes, the transforming comprising;
determining a current node, within the plurality of nodes, that is paired with two or more neighboring nodes, the two or more neighboring nodes comprising a first neighboring node and a second neighboring node;
comparing a first value comprised in the first neighboring node to a second value comprised in the second neighboring node;
determining that the first value is smaller than the second value based upon the comparing;
determining that the first neighboring node is a reference node based upon the determination that the first value is smaller than the second value; and
based upon the determination that the first value is smaller than the second value;
disconnecting the second neighboring node, but not the first neighboring node, which is the reference node, from the current node; and
connecting the second neighboring node to the first neighboring node,the current node, the first neighboring node and the second neighboring node representing a common type, the common type comprising at least one of;
a user ID;
a login ID;
a cookie ID;
a mobile phone ID;
oran IP address.
2 Assignments
0 Petitions
Accused Products
Abstract
Large sets of unorganized data may provide little value in identifying useful observations from such data. For example, an online merchant may maintain a database of millions of user IDs (e.g., a cookie ID, a login ID, a device ID, a network ID, etc.) along with content viewed and/or actions taken with the user IDs, where minimal associations are known between user IDs. It may be advantageous to link together user IDs of respective users to capture a comprehensive view of respective users'"'"' activities. Accordingly, one or more systems and/or techniques for identifying a cluster of nodes based upon transforming a set of node pairings (e.g., pairings of related nodes) one or more times are disclosed herein. Iterative transformations may be performed until respective nodes are paired with merely their smallest neighboring node and are paired with no other node. In this way, node clusters may be identifiable.
-
Citations
20 Claims
-
1. A method, implemented at least in part via a processing unit, for identifying one or more node clusters, comprising:
-
receiving a set of node pairings corresponding to a plurality of nodes, a node pairing comprising a first node connected to a second node; and transforming the set of node pairings one or more times until one or more node clusters are identified from the plurality of nodes, where nodes within a node cluster are paired with a base node within the plurality of nodes by a single connection and are connected to no other nodes, the transforming comprising; determining a current node, within the plurality of nodes, that is paired with two or more neighboring nodes, the two or more neighboring nodes comprising a first neighboring node and a second neighboring node; comparing a first value comprised in the first neighboring node to a second value comprised in the second neighboring node; determining that the first value is smaller than the second value based upon the comparing; determining that the first neighboring node is a reference node based upon the determination that the first value is smaller than the second value; and based upon the determination that the first value is smaller than the second value; disconnecting the second neighboring node, but not the first neighboring node, which is the reference node, from the current node; and connecting the second neighboring node to the first neighboring node, the current node, the first neighboring node and the second neighboring node representing a common type, the common type comprising at least one of; a user ID; a login ID; a cookie ID; a mobile phone ID;
oran IP address. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for identifying one or more node clusters, comprising:
a transformation module configured to; receive a set of node pairings corresponding to a plurality of nodes, a node pairing comprising a first node connected to a second node; transform the set of node pairings one or more times until one or more node clusters are identified from the plurality of nodes, where nodes within a node cluster are paired with a base node within the plurality of nodes by a single connection and are connected to no other nodes, a transformation comprising; determining a current node, within the plurality of nodes, that is paired with two or more neighboring nodes, the two or more neighboring nodes comprising a first neighboring node and a second neighboring node; comparing a first value comprised in the first neighboring node to a second value comprised in the second neighboring node; determining that the first value is smaller than the second value based upon the comparing; determining that the first neighboring node is a reference node based upon the determination that the first value is smaller than the second value; and based upon the determination that the first value is smaller than the second value; disconnecting the second neighboring node, but not the first neighboring node, which is the reference node, from the current node; and connecting the second neighboring node to the first neighboring node, the current node, the first neighboring node and the second neighboring node representing a common type, the common type comprising at least one of; a user ID; a login ID; a cookie ID; a mobile phone ID;
oran IP address. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
20. A computer readable storage device comprising instructions that when executed, perform a method for identifying a cluster of nodes, comprising:
-
receiving a set of node pairings corresponding to a plurality of nodes, a node pairing comprising a first node connected to a second node; and transforming the set of node pairings one or more times until one or more node clusters are identified from the plurality of nodes, where nodes within a node cluster are paired with a base node within the plurality of nodes by a single connection and are connected to no other nodes, the transforming comprising; determining a current node, within the plurality of nodes, that is paired with two or more neighboring nodes, the two or more neighboring nodes comprising a first neighboring node and a second neighboring node; comparing a first value comprised in the first neighboring node to a second value comprised in the second neighboring node; determining that the first value is smaller than the second value based upon the comparing; determining that the first neighboring node is a reference node based upon the determination that the first value is smaller than the second value; and based upon the determination that the first value is smaller than the second value; disconnecting the second neighboring node, but not the first neighboring node, which is the reference node, from the current node; and connecting the second neighboring node to the first neighboring node, the current node, the first neighboring node and the second neighboring node representing a common type, the common type comprising at least one of; a user ID; a login ID; a cookie ID; a mobile phone ID;
oran IP address.
-
Specification