×

Scalable user clustering based on set similarity

  • US 20070038659A1
  • Filed: 08/15/2005
  • Published: 02/15/2007
  • Est. Priority Date: 08/15/2005
  • Status: Active Grant
First Claim
Patent Images

1. A computer program product, encoded on an information carrier, comprising instructions operable to cause data processing apparatus to:

  • obtain a respective interest set for each of multiple users, each interest set representing items in which the respective user has expressed interest through interaction with a data processing system;

    for each of the multiple users, determine k hash values of the respective interest set, wherein the i-th hash value is a minimum value in the respective interest set under a corresponding i-th hash function, where i is an integer between 1 and k, and where k is an integer greater than or equal to 1; and

    assign each of the multiple users to each of the respective k clusters established for the respective user, the i-th cluster being represented by the i-th hash value, wherein the assignment of each of the multiple users to k clusters is done without regard to the assignment of any of the other users to k clusters.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×