Method for discovery of clusters of objects in an arbitrary undirected graph using a difference between a fraction of internal connections and maximum fraction of connections by an outside object
First Claim
Patent Images
1. A computer-implemented method for discovery of a cluster of objects in an arbitrary undirected graph, the method comprising:
- determining a subset of the objects by performing a random walk starting from a first object of the objects and following a plurality of random edges of subsequent objects, the subset comprising the first object and the subsequent objects;
determining an enlarged subset by enlarging the subset to include other objects well-connected to the subset; and
determining whether the enlarged subset is a cluster wherein fraction of internal connections which defines a cluster (β
) is determined, a maximum fraction of connections by an outside object to said cluster (α
) is determined, and said cluster is defined as acceptable when the difference between said fraction of internal connections and said maximum fraction of connections by an outside object (β
−
α
) exceeds a threshold value.
8 Assignments
0 Petitions
Accused Products
Abstract
A method for discovery of a cluster of objects in an arbitrary undirected graph. A subset of the objects is determined by performing a random walk starting from a first object of the objects and following a plurality of random edges of subsequent objects, the subset comprising the first object and the subsequent objects. An enlarged subset is determined by enlarging the subset to include other objects well-connected to the subset. It is determined whether the enlarged subset is a cluster.
12 Citations
20 Claims
-
1. A computer-implemented method for discovery of a cluster of objects in an arbitrary undirected graph, the method comprising:
-
determining a subset of the objects by performing a random walk starting from a first object of the objects and following a plurality of random edges of subsequent objects, the subset comprising the first object and the subsequent objects; determining an enlarged subset by enlarging the subset to include other objects well-connected to the subset; and determining whether the enlarged subset is a cluster wherein fraction of internal connections which defines a cluster (β
) is determined, a maximum fraction of connections by an outside object to said cluster (α
) is determined, and said cluster is defined as acceptable when the difference between said fraction of internal connections and said maximum fraction of connections by an outside object (β
−
α
) exceeds a threshold value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-usable storage medium having computer-readable program code embodied therein for causing a computer system to perform a method for discovery of clusters of vertices in an arbitrary undirected graph, the method comprising:
for a plurality of vertices of the arbitrary undirected graph performing a random walk starting from a selected vertex and traversing a predetermined number of random edges of subsequent vertices yielding a subset of the vertices, the subset comprising the selected vertex and the subsequent vertices; selectively adding vertices to the subset and removing vertices from the subset based at least in part on connectivity of the vertices to the subset yielding a proposed cluster; and evaluating whether the proposed cluster is acceptable as a cluster wherein fraction of internal connections which defines said cluster (β
) is determined, a maximum fraction of connections by an outside object to said cluster (α
) is determined, and said cluster is defined as acceptable when the difference between said fraction of internal connections and said maximum fraction of connections by an outside object (β
−
α
) exceeds a threshold value.- View Dependent Claims (12, 13, 14, 15)
-
16. A system for identifying clusters of objects in an arbitrary undirected graph, the system comprising:
-
means for determining a subset of the objects by performing a random walk starting from a first object of the objects and following a plurality of random edges of subsequent objects, the subset comprising the first object and the subsequent objects; means for determining an enlarged subset by enlarging the subset to include other objects well-connected to the subset; and means for determining whether the enlarged subset is a cluster wherein fraction of internal connections which defines a cluster (β
) is determined, a maximum fraction of connections by an outside object to said cluster (α
) is determined, and said cluster is defined as acceptable when the difference between said fraction of internal connections and said maximum fraction of connections by an outside object (β
−
α
) exceeds a threshold value. - View Dependent Claims (17, 18, 19, 20)
-
Specification