System, method and computer program for analysing and visualising data
First Claim
Patent Images
1. A method for identifying clusters within a network including a plurality of nodes and links, comprising the steps of:
- determining a link value for each node in the network, with each node being characterized such that it does not require the sharing of an attribute value to be clustered;
determining a local maxima within the network by locating node values where the sum of link values are higher than the sum of link values for all adjacent nodes;
determining a list of the shortest pathways between a local maximum and all other nodes in the network, wherein a pathway embodies a rate of change of link values and is able to traverse a plurality of linked nodes;
collapsing the pathways in accordance with a predetermined algorithm; and
associating all nodes that remain connected to each local maximum along the pathways.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for identifying clusters within a network including a plurality of nodes and links, comprising the steps of:
- determining a link value for each node in the network;
- determining a local maxima within the network by locating node values where the sum of link values are higher than the sum of link values for all adjacent nodes;
- determining a list of the shortest pathways between a local maximum and all other nodes in the network;
- collapsing the pathways in accordance with a predetermined algorithm; and
- associating all nodes that remain connected to each local maximum along the pathways.
84 Citations
13 Claims
-
1. A method for identifying clusters within a network including a plurality of nodes and links, comprising the steps of:
-
determining a link value for each node in the network, with each node being characterized such that it does not require the sharing of an attribute value to be clustered; determining a local maxima within the network by locating node values where the sum of link values are higher than the sum of link values for all adjacent nodes; determining a list of the shortest pathways between a local maximum and all other nodes in the network, wherein a pathway embodies a rate of change of link values and is able to traverse a plurality of linked nodes; collapsing the pathways in accordance with a predetermined algorithm; and associating all nodes that remain connected to each local maximum along the pathways. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer program product comprising at least one non-transitory computer-readable storage medium having computer-readable program code portions stored therein, the computer-readable program code portions comprising:
-
an executable portion configured to determine a link value for each node in the network, with each node being characterized such that it does not require the sharing of an attribute value to be clustered; an executable portion configured to determining a local maxima within the network by locating node values where the sum of link values are higher than the sum of link values for all adjacent nodes; an executable portion configured to determining a list of the shortest pathways between a local maximum and all other nodes in the network, wherein a pathway embodies a rate of change of link values and is able to traverse a plurality of linked nodes; an executable portion configured to collapsing the pathways in accordance with a predetermined algorithm; and an executable portion configured to associating all nodes that remain connected to each local maximum along the pathways.
-
-
7. A method for determining the probability of a data record being associated with another data record, comprising the steps of
determining the length of the shortest pathway value and the number of unique pathways of the shortest length between the two data records, wherein a pathway embodies a rate of change of link values and is able to traverse a plurality of linked data records; -
dividing the shortest pathway length by the number of unique pathways to determine a pathway value, wherein the pathway value is an indicator of the similarity between the selected data record and the at least one other data record.
-
-
8. A system for identifying clusters within a network including a plurality of nodes and links, comprising:
-
a processor arranged to determine a link value for each node in the network, with each node being characterized such that it does not require the sharing of an attribute value to be clustered; determine a local maxima within the network by locating node values where the sum of link values are higher than the sum of link values for all adjacent nodes; determine a list of the shortest pathways between a local maximum and all other nodes in the network, wherein a pathway embodies a rate of change of link values and is able to traverse a plurality of linked nodes; collapse the pathways in accordance with a predetermined algorithm; and associate all nodes that remain connected to each local maximum along the pathways. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A system for determining the probability of a data record being associated with another data record, comprising:
-
a processor arranged to determine the length of the shortest pathway value and the number of unique pathways of the shortest length between the two data records, wherein a pathway embodies a rate of change of link values and is able to traverse a plurality of linked data records; and divide the shortest pathway length by the number of unique pathways to determine a pathway value, wherein the pathway value is an indicator of the similarity between the selected data record and the at least one other data record.
-
Specification