Collaborative filtering
First Claim
1. A computer-implemented method comprising:
- generating an initial overall probability distribution p(s|u) as a combination of an initial first probability distribution p(z|u) and an initial second probability distribution p(s|z), wherein the initial first probability distribution p(z|u) is a probability of a particular category of a plurality of categories given a user of a set of users, wherein the categories are represented by one or more latent variables, and wherein the initial second probability distribution p(s|z) is a probability distribution of a set of items with respect to the one or more latent variables;
calculating an updated second probability distribution p(s|z)new of a current set of items with respect to the one or more latent variables including, wherein the updated second probability distribution is calculated using counter values determined based on prior user selections of items in the set of items and according to user membership in the categories represented by the latent variables, wherein each counter value corresponds to a category of which the user is a member, and wherein each counter value is fractionally incremented relative to other categories of which the user is also a member; and
generating a relationship score for each of one or more items in the current set of items, wherein each relationship score generated for a particular item relates the particular item to relating to a particular user in the set of users based on the particular user'"'"'s category memberships and the updated second probability distribution.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods, and apparatus, including computer program products, for collaborative filtering are provided. A method is provided. The method includes clustering a plurality of entities with respect to one or more latent variables in a probability distribution model of a relationship between a set of entities and a set of items, the probability distribution model comprising a probability distribution of the set of items with respect to the latent variables. The method also includes, as new items are added to the set of items, updating the probability distribution of the set of the items with respect to the latent variables, and generating an updated relationship score for an entity with respect to the set of items based on the entity'"'"'s fractional membership in the clustering with respect to the latent variables and based on the updated probability distribution of the set of the items with respect to the latent variables.
-
Citations
19 Claims
-
1. A computer-implemented method comprising:
-
generating an initial overall probability distribution p(s|u) as a combination of an initial first probability distribution p(z|u) and an initial second probability distribution p(s|z), wherein the initial first probability distribution p(z|u) is a probability of a particular category of a plurality of categories given a user of a set of users, wherein the categories are represented by one or more latent variables, and wherein the initial second probability distribution p(s|z) is a probability distribution of a set of items with respect to the one or more latent variables; calculating an updated second probability distribution p(s|z)new of a current set of items with respect to the one or more latent variables including, wherein the updated second probability distribution is calculated using counter values determined based on prior user selections of items in the set of items and according to user membership in the categories represented by the latent variables, wherein each counter value corresponds to a category of which the user is a member, and wherein each counter value is fractionally incremented relative to other categories of which the user is also a member; and generating a relationship score for each of one or more items in the current set of items, wherein each relationship score generated for a particular item relates the particular item to relating to a particular user in the set of users based on the particular user'"'"'s category memberships and the updated second probability distribution. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
one or more computer-readable storage media having instructions stored thereon; and data processing apparatus programmed to execute the instructions to perform operations comprising; generating an initial overall probability distribution p(s|u) as a combination of an initial first probability distribution p(z|u) and an initial second probability distribution p(s|z), wherein the initial first probability distribution p(z|u) is a probability of a particular category of a plurality of categories given a user of a set of users, wherein the categories are represented by one or more latent variables, and wherein the initial second probability distribution p(s|z) is a probability distribution of a set of items with respect to the one or more latent variables; calculating an updated second probability distribution p(s|z)new of a current set of items with respect to the one or more latent variables including, wherein the updated second probability distribution is calculated using counter values determined based on prior user selections of items in the set of items and according to user membership in the categories represented by the latent variables, wherein each counter value corresponds to a category of which the user is a member, and wherein each counter value is fractionally incremented relative to other categories of which the user is also a member; and generating a relationship score for each of one or more items in the current set of items, wherein each relationship score generated for a particular item relates the particular item to relating to a particular user in the set of users based on the particular user'"'"'s category memberships and the updated second probability distribution. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A non-transitory computer-readable medium having instructions stored thereon which, when executed by data processing apparatus, cause the data processing apparatus to perform operations comprising:
-
generating an initial overall probability distribution p(s|u) as a combination of an initial first probability distribution p(z|u) and an initial second probability distribution p(s|z), wherein the initial first probability distribution p(z|u) is a probability of a particular category of a plurality of categories given a user of a set of users, wherein the categories are represented by one or more latent variables, and wherein the initial second probability distribution p(s|z) is a probability distribution of a set of items with respect to the one or more latent variables; calculating an updated second probability distribution p(s|z)new of a current set of items with respect to the one or more latent variables including, wherein the updated second probability distribution is calculated using counter values determined based on prior user selections of items in the set of items and according to user membership in the categories represented by the latent variables, wherein each counter value corresponds to a category of which the user is a member, and wherein each counter value is fractionally incremented relative to other categories of which the user is also a member; and generating a relationship score for each of one or more items in the current set of items, wherein each relationship score generated for a particular item relates the particular item to relating to a particular user in the set of users based on the particular user'"'"'s category memberships and the updated second probability distribution.
-
Specification