Detecting click spam
First Claim
Patent Images
1. A computer-implemented method, the method comprising:
- obtaining a model of typical behavior for each of a plurality of different statistics derived from selection data associated with each of a plurality of user identifiers, the selection data representing prior selections of search results by users associated with respective user identifiers, wherein the search results are responsive to respective search queries;
determining a distribution of counts of deviations to counts of the user identifiers having those deviations, wherein a particular deviation is a calculated statistic for a user identifier that deviates from typical behavior for the statistic according to the model;
determining a plurality of zones within the distribution of counts of deviations to counts of the user identifiers having those deviations, wherein each zone defines a respective range of deviations in the distribution;
determining that a first user identifier of the plurality of user identifiers has a count of deviations that is within a particular zone of the plurality of zones within the distribution; and
modifying, by one or more computers, selection data associated with the first user identifier based on a weight associated with the particular zone within the distribution in response to determining that the first user identifier has a count of deviations that is within the particular zone within the distribution.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method for processing network activities is described. The method includes identifying a model that specifies attributes for network objects, identifying a network object having one or more attributes that deviate from the model, and providing as an input to a ranking algorithm a value associated with the deviance of the one or more attributes of the identified network object.
252 Citations
24 Claims
-
1. A computer-implemented method, the method comprising:
-
obtaining a model of typical behavior for each of a plurality of different statistics derived from selection data associated with each of a plurality of user identifiers, the selection data representing prior selections of search results by users associated with respective user identifiers, wherein the search results are responsive to respective search queries; determining a distribution of counts of deviations to counts of the user identifiers having those deviations, wherein a particular deviation is a calculated statistic for a user identifier that deviates from typical behavior for the statistic according to the model; determining a plurality of zones within the distribution of counts of deviations to counts of the user identifiers having those deviations, wherein each zone defines a respective range of deviations in the distribution; determining that a first user identifier of the plurality of user identifiers has a count of deviations that is within a particular zone of the plurality of zones within the distribution; and modifying, by one or more computers, selection data associated with the first user identifier based on a weight associated with the particular zone within the distribution in response to determining that the first user identifier has a count of deviations that is within the particular zone within the distribution. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system comprising:
-
one or more data processing apparatus; and a computer-readable storage device having instructions stored thereon that, when executed by the one or more data processing apparatus, cause the one or more data processing apparatus to perform operations comprising; obtaining a model of typical behavior for each of a plurality of different statistics derived from selection data associated with each of a plurality of user identifiers, the selection data representing prior selections of search results by users associated with respective user identifiers, wherein the search results are responsive to respective search queries; determining a distribution of counts of deviations to counts of the user identifiers having those deviations, wherein a particular deviation is a calculated statistic for a user identifier that deviates from typical behavior for the statistic according to the model; determining a plurality of zones within the distribution of counts of deviations to counts of the user identifiers having those deviations, wherein each zone defines a respective range of deviations in the distribution; determining that a first user identifier of the plurality of user identifiers has a count of deviations that is within a particular zone of the plurality of zones within the distribution; and modifying selection data associated with the first user identifier based on a weight associated with the particular zone within the distribution in response to determining that the first user identifier has a count of deviations that is within the particular zone within the distribution. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
Specification