Methods and systems for identifying manipulated articles
First Claim
Patent Images
1. A method, comprising:
- determining at least one cluster comprising a plurality of articles;
analyzing signals associated with one or more articles in the plurality of articles to determine an overall signal for the cluster; and
determining if articles in the plurality of articles are manipulated articles based at least in part on the overall signal;
wherein determining the at least one cluster comprises computing a dense bipartite subgraph of articles comprising doorway articles and target articles, wherein the doorway articles contain links to the target articles.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods that identify manipulated articles are described. In one embodiment, a search engine implements a method comprising determining at least one cluster comprising a plurality of articles, analyzing signals to determine an overall signal for the cluster, and determining if the articles are manipulated articles based at least in part on the overall signal.
-
Citations
22 Claims
-
1. A method, comprising:
-
determining at least one cluster comprising a plurality of articles; analyzing signals associated with one or more articles in the plurality of articles to determine an overall signal for the cluster; and determining if articles in the plurality of articles are manipulated articles based at least in part on the overall signal; wherein determining the at least one cluster comprises computing a dense bipartite subgraph of articles comprising doorway articles and target articles, wherein the doorway articles contain links to the target articles. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method comprising:
-
forming a cluster of documents from a plurality of network-accessible documents by identifying a dense bipartite subgraph from the plurality of network-accessible documents, the dense bipartite subgraph comprising a first set of doorway documents and a second set of target documents, wherein doorway documents in the first set have links to target documents in the second set; analyzing a plurality of documents in the cluster of documents to determine an overall value for the cluster; and when the overall value is greater than a threshold value, marking at least one of the documents in the cluster as a manipulated article. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification