Search ranger system and double-funnel model for search spam analyses and browser protection
First Claim
1. One or more computing devices comprising:
- one or more processors; and
one or more computer readable storage media storing instructions that, when executed by the one or more processors, configure the one or more computing devices to;
identify one or more spammer targeted keywords relating to keywords used in search queries submitted to a search engine by one or more users, the identifying being based in part on a popularity of the one or more spammer targeted keywords amongst advertisers, wherein the popularity of the one or more spammer targeted keywords amongst the advertisers is based in part on a number of bids provided by the advertisers, and wherein the one or more spammer targeted keywords is associated with a syndication business, the syndication business including at least a publisher, an advertiser, and a syndicator;
perform one or more queries for the one or more spammer targeted keywords to retrieve a set of uniform resource locators (URLs) that provide search results;
scan the set of URLs based in part on the one or more spammer targeted keywords by accessing at least one URL in the set of URLs;
determine that the at least one URL redirects to a known spammer domain, the at least one URL including an associated URL domain;
determine that the at least one URL of the set of URLs comprises a spam URL;
update a list of known spam domains to include the associated URL domain; and
determine that the associated URL domain is associated with a spam syndication program, the spam syndication program including at least a spam publisher associated with a doorway page for redirecting a browser to a redirection domain associated with the spam publisher.
2 Assignments
0 Petitions
Accused Products
Abstract
An exemplary system for monitoring search spam and protecting against search spam includes a self-monitoring subsystem to uncover spam patterns and a self-protection subsystem to protect against spam by providing spam-related information to strengthen a relevance ranking algorithm. An exemplary architecture for monitoring search spam includes a first component to receive one or more spammer targeted keywords and to search, scan and analyze URLs based at least in part on the one or more spammer targeted keywords, a second component to receive one or more URLs from the first component and to verify one or more of these URLs as a spam URL and a third component to collect spammer targeted keywords associated with one or more spam URLs and to provide one or more of the spammer targeted keywords to the first component.
-
Citations
24 Claims
-
1. One or more computing devices comprising:
-
one or more processors; and one or more computer readable storage media storing instructions that, when executed by the one or more processors, configure the one or more computing devices to; identify one or more spammer targeted keywords relating to keywords used in search queries submitted to a search engine by one or more users, the identifying being based in part on a popularity of the one or more spammer targeted keywords amongst advertisers, wherein the popularity of the one or more spammer targeted keywords amongst the advertisers is based in part on a number of bids provided by the advertisers, and wherein the one or more spammer targeted keywords is associated with a syndication business, the syndication business including at least a publisher, an advertiser, and a syndicator; perform one or more queries for the one or more spammer targeted keywords to retrieve a set of uniform resource locators (URLs) that provide search results; scan the set of URLs based in part on the one or more spammer targeted keywords by accessing at least one URL in the set of URLs; determine that the at least one URL redirects to a known spammer domain, the at least one URL including an associated URL domain; determine that the at least one URL of the set of URLs comprises a spam URL; update a list of known spam domains to include the associated URL domain; and determine that the associated URL domain is associated with a spam syndication program, the spam syndication program including at least a spam publisher associated with a doorway page for redirecting a browser to a redirection domain associated with the spam publisher. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method comprising:
-
determining at least one spammer targeted keyword relating to a common keyword used in commerce search queries, the determining being based in part on a popularity of the at least one spammer targeted keyword amongst advertisers, wherein the popularity of the at least one spammer targeted keyword amongst the advertisers is based in part on a number of bids provided by the advertisers, and wherein the at least one spammer targeted keyword is associated with a syndication business, the syndication business including at least a publisher, an advertiser, and a syndicator; inputting the at least one spammer targeted keyword to a search engine to generate search results including a plurality of uniform resource locators (URLs); accessing, by one or more processors, one or more URLs of the plurality of URLs; recording the one or more URLs, wherein the recording comprises redirection tracking that intercepts redirection traffic; grouping the one or more recorded URLs using similarity-based grouping; verifying that at least one of the one or more URLs comprises a spam URL; and determining that the spam URL is associated with a spam syndication program, the spam syndication program including at least a spam publisher associated with a doorway page for redirecting a browser to a redirection domain associated with the spam publisher. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method comprising:
-
identifying one or more spammer targeted keywords related to keywords used in search queries, the identifying being based in part on a popularity of the one or more spammer targeted keywords amongst advertisers, wherein the popularity of the one or more spammer targeted keywords amongst the advertisers is based in part on a number of bids provided by the advertisers, and wherein the one or more spammer targeted keywords is associated with a syndication business, the syndication business including at least a publisher, an advertiser, and a syndicator; implementing a search based in part on the one or more spammer targeted keywords; accessing, by one or more processors, individual uniform resource locators (URLs) of a set of URLs that comprise a search result to the search; recording one or more redirection URLs associated with the set of URLs based in part on the accessing; determining that the one or more redirection URLs redirect to a known spammer domain, the one or more redirection URLs including one or more associated URL domains; classifying, based in part on the recording and the determining, one or more URLs of the set of URLs as one or more classified spam URLs; updating a list of known spam domains to include the one or more associated URL domains; and determining that the one or more associated URL domains are associated with a spam syndication program, the spam syndication program including at least a spam publisher associated with a doorway page for redirecting a browser to a redirection domain associated with the spam publisher. - View Dependent Claims (22, 23, 24)
-
Specification