System and method for using noisy collaborative filtering to rank and present items
First Claim
Patent Images
1. A collaborative filtering system for predicting a user'"'"'s rating for an item, comprising:
- a memory storing;
a set of correlation coefficients, wherein each correlation coefficient is a measure of similarity in ratings between pairs of users in the system who have rated a particular item; and
ratings for the item made by other users in the system; and
a processor, responsive to a request for a predicted user'"'"'s rating for the item, for calculating the predicted user'"'"'s rating, wherein predicted user'"'"'s rating is the sum of the product of a rating and its respective correlation coefficient divided by the sum of the correlation coefficients, and for calculating the variance of the predicted user'"'"'s rating, wherein the variance of the predicted user'"'"'s rating is a function of the variance of the correlation coefficients and the variance of the ratings.
8 Assignments
0 Petitions
Accused Products
Abstract
A system and method of providing predicted user ratings includes calculating the accuracy of predictions based on the variance of distribution of the predicted user'"'"'s rating. The system and method present and rank the results by treating the variance as a source of noise. The decision to present or not to present an item is made by sampling the probability distribution of the predicted rating and comparing the result to some user-set threshold (e.g., “show me all results that the system predicts I will score 3 or higher”) or a system default value.
283 Citations
23 Claims
-
1. A collaborative filtering system for predicting a user'"'"'s rating for an item, comprising:
-
a memory storing;
a set of correlation coefficients, wherein each correlation coefficient is a measure of similarity in ratings between pairs of users in the system who have rated a particular item; and
ratings for the item made by other users in the system; and
a processor, responsive to a request for a predicted user'"'"'s rating for the item, for calculating the predicted user'"'"'s rating, wherein predicted user'"'"'s rating is the sum of the product of a rating and its respective correlation coefficient divided by the sum of the correlation coefficients, and for calculating the variance of the predicted user'"'"'s rating, wherein the variance of the predicted user'"'"'s rating is a function of the variance of the correlation coefficients and the variance of the ratings. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
where Siy is the rating of each user Y who has rated the item i, P0 is a predetermined value and α
xy is the correlation coefficient between the user X and user Y.
-
-
3. The system of claim 2, wherein the processor calculates the variance of the predicted user'"'"'s rating, Var(Pix), in accordance with the relationship:
-
4. The system of claim 2, wherein the processor calculates the variance of the predicted user'"'"'s rating, Var(Pix), in accordance with the relationship:
-
where the range of ratings varies from 0 to 2SA, Ky=(Syi−
SA)−
sign(α
xy)×
(Pxi−
SA) and is bounded by [6SA,2SA].
-
-
5. The system of claim 4, wherein the processor provides the predicted user'"'"'s rating when the predicted user'"'"'s rating calculation exceeds a predetermined threshold value and provides the variance as a range about the predicted user'"'"'s rating.
-
6. The system of claim 1, wherein the processor, responsive to a user set threshold rating value, provides the predicted user rating and variance to the user when the predicted user rating is greater than the threshold rating value.
-
7. The system of claim 1, wherein the processor ranks and presents items to the user by estimating the distribution of the prediction rating as a normal distribution with expected value given by the calculated predicted user'"'"'s rating and variance by the calculated variance of the predicted user'"'"'s rating.
-
8. The system of claim 1, wherein the processor ranks and presents items to the user by estimating the distribution of the prediction rating from empirically determined usage data and using the observed distribution with expected value given by the calculated predicted user'"'"'s rating and variance by the calculated variance of the predicted user'"'"'s rating.
-
9. The system of claim 1, wherein the processor ranks and presents items to the user by sampling the probability distribution of the predicted user'"'"'s rating and presents the predicted user'"'"'s rating when the probability distribution is greater than a user set threshold value.
-
10. The system of claim 9, wherein the probability that the predicted score is greater than the user set threshold value r0 is given by the probability that the expected user'"'"'s predicted rating is greater than r0:
-
11. The system of claim 1, wherein the set of correlation coefficients comprises a set of initial correlation coefficients α
-
xy(0), wherein each initial correlation coefficient is an assumed measure of similarity between pairs of users in the system; and
wherein the processor, responsive to a rating for an item, uses the received rating to update the user'"'"'s correlation coefficients with the other users.
-
xy(0), wherein each initial correlation coefficient is an assumed measure of similarity between pairs of users in the system; and
-
12. A method of predicting a user'"'"'s rating for an item in a collaborative filtering system, comprising:
-
providing a set of correlation coefficients, wherein each correlation coefficient is a measure of the similarity in ratings between pairs of users who have rated a particular item;
providing ratings for the item made by other users in the system;
calculating a predicted user rating for the item, wherein the predicted rating comprises the sum of the product of a rating and its respective correlation coefficient divided by the sum of the correlation coefficients; and
calculating a variance of the predicted rating, wherein the variance of the predicted rating is a function of the variance of the correlation coefficients and the variance of the ratings. - View Dependent Claims (13, 14, 15, 16, 17, 18)
where Siy is the rating of each user Y who has rated the item i, P0 is a predetermined value and α
xy is the correlation coefficient between the user X and user Y.
-
-
14. The method of claim 13, wherein the variance of the predicted user'"'"'s rating, Var(Pix), is calculated in accordance with the relationship:
-
15. The method of claim 13, wherein the variance of the predicted user'"'"'s rating, Var(Pix), is calculated in accordance with the relationship:
-
where the range of ratings varies from 0 to 2SA, Ky=(Syi−
SA)−
sign(α
xy)×
(Pxi−
SA) and is bounded by [6SA,2SA].
-
-
16. The method of claim 15, further comprising:
-
providing the predicted user'"'"'s rating when the predicted user'"'"'s rating calculation exceeds a predetermined threshold value; and
providing the variance as a range about the predicted user'"'"'s rating.
-
-
17. The method of claim 12, further comprising:
-
providing a user set threshold rating value; and
providing the predicted user rating and variance to the user when the predicted user rating is greater than the threshold rating value.
-
-
18. The method of claim 12, further comprising:
-
providing a set of initial correlation coefficients α
xy(0), wherein each initial correlation coefficient is an assumed measure of similarity between pairs of users in the system; and
responsive to receipt of a rating for an item, using the received rating to update the user'"'"'s correlation coefficients with the other users.
-
-
19. A method of ranking items to be presented to a user in a collaborative filtering system, comprising:
-
providing a set of correlation coefficients, wherein each correlation coefficient is a measure of the similarity in ratings between pairs of users who have rated a particular item;
providing ratings for the item made by other users in the system;
calculating a predicted user rating for the item, wherein the predicted rating comprises the sum of the product of a rating and its respective correlation coefficient divided by the sum of the correlation coefficients;
calculating a variance of the predicted rating, wherein the variance of the predicted rating is a function of the variance of the correlation coefficients and the variance of the ratings;
sampling the probability distribution of the predicted user'"'"'s rating; and
presenting the predicted user'"'"'s rating when the value sampled from the probability distribution is greater than a predetermined threshold value. - View Dependent Claims (20, 21, 22, 23)
where Siy is the rating of each user Y who has rated the item i, P0 is a predetermined value and α
xy is the correlation coefficient between the user X and user Y.
-
-
21. The method of claim 20, wherein the variance of the predicted user'"'"'s rating, Var(Pix), is calculated in accordance with the relationship:
-
22. The method of claim 20, wherein the variance of the predicted user'"'"'s rating, Var(Pix), is calculated in accordance with the relationship:
-
where the range of ratings varies from 0 to 2SA, Ky=(Syi−
SA)−
sign(α
xy)×
(Pxi−
SA) and is bounded by [6SA,2SA].
-
-
23. The method of claim 19, wherein the probability that the predicted score is greater than a user set threshold value r0 is given by the probability that the expected predicted user'"'"'s rating is greater than r0:
Specification