News group clustering based on cross-post graph
First Claim
Patent Images
1. A system that facilitates analyzing newsgroup clusters, comprising:
- a data reception component that receives and recognizes data relating to a plurality of newsgroups; and
an engine that constructs a weighted graph with a subset of the newsgroups represented as vertices of the graph, and cross-postings relating to the subset of newsgroups represented as edges.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and/or method that facilitates analyzing newsgroup clusters. A data reception component receives data relating to a plurality of newsgroups and relays the data to an engine that constructs a weighted graph. The weighted graph represents a subset of the newsgroups as vertices of the graph. The vertices are connected by edges, which represent cross-postings relating to the subset of newsgroups.
90 Citations
42 Claims
-
1. A system that facilitates analyzing newsgroup clusters, comprising:
-
a data reception component that receives and recognizes data relating to a plurality of newsgroups; and
an engine that constructs a weighted graph with a subset of the newsgroups represented as vertices of the graph, and cross-postings relating to the subset of newsgroups represented as edges. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method for creating a weighted newsgroup graph comprising:
-
receiving and recognizing data relating to a plurality of newsgroups; and
constructing a weighted graph such that newsgroups are represented as vertices and cross-posts are represented as edges. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A system that facilitates analyzing newsgroup clusters, comprising:
-
a data reception component that receives data relating to a plurality of newsgroups;
an engine that constructs a weighted graph with a subset of the newsgroups represented as vertices of the graph, and cross-postings relating to the subset of newsgroups represented as edges; and
further comprising at least one of the following components;
a filtering component that facilitates excluding particular newsgroups from being represented in the graph so as to facilitate reducing the size of the graph;
a paring component that trims edges of the graph with weight less than a threshold weight so as to facilitate reducing the size of the graph;
a segmenting component that segments the graph via spectral clustering; and
a post-processing component that merges a first cluster into a segment cluster if a sum of weights between the clusters is greater than a threshold. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39)
-
-
40. A method for creating a cluster graph comprising the following steps:
-
receiving newsgroup data;
excluding newsgroups that do not contain a threshold number of postings;
paring edges with weight below a threshold;
generating a weighted graph with the newsgroups represented as vertices and the cross-postings represented as edges;
segmenting the graph into clusters;
merging clusters if the sum of the weights between clusters is greater than a threshold; and
outputting the graph.
-
-
41. A system that facilitates analyzing newsgroup clusters, comprising:
-
means for receiving and recognizing data relating to a plurality of newsgroups; and
means for constructing a weighted graph with a subset of the newsgroups represented as vertices of the graph, and cross-postings relating to the subset of newsgroups represented as edges.
-
-
42. A data packet that passes between at least two computer processes, comprising:
a field that stores a weighted graph representative of a plurality of newsgroups with a subset of the newsgroups represented as vertices of the graph, and cross-postings relating to the subset of newsgroups represented as edges
Specification