System and method for inferring user interest based on analysis of user-generated metadata
First Claim
1. A method operable on a computer for inferring a user'"'"'s interests from user generated tags, the method comprising:
- collecting a plurality of user-generated tags from a given user and one or more users in a social network of the given user over a predetermined period of time;
electronically, calculating, via a processing device, a corresponding z-score for each of the plurality of user-generated tags, where the corresponding z-score is a measure of a statistical significance for a given tag based on an observed frequency of the given tag being selected and a tag selection frequency distribution over the plurality of user-generated tags for the predetermined period of time;
retaining one or more tags from the plurality of user-generated tags where the corresponding z-score of the one or more tags is higher than a threshold determined based on random tag selection by the given user and the one or more users in the social network of the given user based on the tag selection frequency distribution;
comparing the retained one or more tags of the given user with the retained one or more tags of the one or more users in the social network of the given user; and
inferring the given user'"'"'s interest based on the comparison of the retained one or more tags of the given user and the retained one or more tags of the one or more users in the social network of the given user.
9 Assignments
0 Petitions
Accused Products
Abstract
User-generated tags from viewing web-based content are collected over a predetermined period of time. A subset of distinct or unique tags is identified from among the collected tags. A z-score is calculated for each identified distinct tag, where the z-score is a measure of the statistical significance of the tag. The subset of distinct tags is then sorted based on their corresponding z-score. All distinct tags having a corresponding z-score lower than a predetermined threshold are rejected and the remaining distinct tags, having a corresponding z-score higher than the threshold are used to infer a user'"'"'s interest. The ability to infer a user'"'"'s interests from the remaining distinct tags may thus benefit web-based applications by achieving a high degree of accuracy in predicting the interests of users by leveraging on the use of the user generated content tags and keywords.
-
Citations
25 Claims
-
1. A method operable on a computer for inferring a user'"'"'s interests from user generated tags, the method comprising:
-
collecting a plurality of user-generated tags from a given user and one or more users in a social network of the given user over a predetermined period of time; electronically, calculating, via a processing device, a corresponding z-score for each of the plurality of user-generated tags, where the corresponding z-score is a measure of a statistical significance for a given tag based on an observed frequency of the given tag being selected and a tag selection frequency distribution over the plurality of user-generated tags for the predetermined period of time; retaining one or more tags from the plurality of user-generated tags where the corresponding z-score of the one or more tags is higher than a threshold determined based on random tag selection by the given user and the one or more users in the social network of the given user based on the tag selection frequency distribution; comparing the retained one or more tags of the given user with the retained one or more tags of the one or more users in the social network of the given user; and inferring the given user'"'"'s interest based on the comparison of the retained one or more tags of the given user and the retained one or more tags of the one or more users in the social network of the given user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for inferring a user'"'"'s interests from user generated tags, the system comprising:
-
a processor; a memory connected to the processor and storing instructions for operating the processor to perform steps of; collecting a plurality of user-generated tags from a given user and one or more users in a social network of the given user over a predetermined period of time; calculating a corresponding z-score for each of the plurality of user-generated tags, where the corresponding z-score is a measure of a statistical significance for a given tag based on an observed frequency of the given tag being selected and a tag selection frequency distribution over the plurality of user-generated tags for the predetermined period of time; retaining one or more tags from the plurality of user-generated tags where the corresponding z-score of the one or more tags is higher than a threshold determined based on random tag selection by the given user and the one or more users in the social network of the given user based on the tag selection frequency distribution; comparing the retained one or more tags of the given user with the retained one or more tags of the one or more users in the social network of the given user; and inferring the given user'"'"'s interest based on the comparison of the retained one or more tags of the given user and the retained one or more tags of the one or more users in the social network of the given user. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A system for inferring a user'"'"'s interests from user generated tags, the system comprising:
-
means for collecting a plurality of user-generated tags from a given user and one or more users in a social network of the given user over a predetermined period of time; means for calculating a corresponding z-score for each of the plurality of user-generated tags, where the corresponding z-score is a measure of a statistical significance for a given tag based on an observed frequency of the given tag being selected and a tag selection frequency distribution over the plurality of user-generated tags for the predetermined period of time; means for retaining one or more tags from the plurality of user-generated tags where the corresponding z-score of the one or more tags is higher than a threshold determined based on random tag selection by the given user and the one or more users in the social network of the given user based on the tag selection frequency distribution; means for comparing the retained one or more tags of the given user with the retained one or more tags of the one or more users in the social network of the given user; and means for inferring the given user'"'"'s interest based on the comparison of the retained one or more tags of the given user and the retained one more tags of the one or more users in the social network of the given user. - View Dependent Claims (18, 19)
-
-
20. A program stored on non-transitory computer readable media for making a computer execute steps of:
-
collecting a plurality of user-generated tags from a given user and one or more users in a social network of the given user over a predetermined period of time; calculating a corresponding z-score for each of the plurality of user-generated tags, where the corresponding z-score is a measure of a statistical significance for a given tag based on an observed frequency of the given tag being selected and a tag selection frequency distribution over the plurality of user-generated tags for the predetermined period of time; retaining one or more tags from the plurality of user-generated tags where the corresponding z-score of the one or more tags is higher than a threshold determined based on random tag selection by the given user and the one or more users in the social network of the given user based on the tag selection frequency distribution; comparing the retained one or more tags of the given user with the retained one or more tags of the one or more users in the social network of the given user; and inferring the given user'"'"'s interest based on the comparison of the retained one or more tags of the given user and the retained one or more tags of the one or more users in the social network of the given user. - View Dependent Claims (21, 22)
-
-
23. A non-transitory recording medium recording a program for making a computer execute steps of:
-
collecting a plurality of user-generated tags from a given user and one or more users in a social network of the given user over a predetermined period of time; calculating a corresponding z-score for each of the plurality of user-generated tags, where the corresponding z-score is a measure of a statistical significance for a given tag based on an observed frequency of the given tag being selected and a tag selection frequency distribution over the plurality of user-generated tags for the predetermined period of time; retaining one or more tags from the plurality of user-generated tags where the corresponding z-score of the one or more tags is higher than a threshold determined based on random tag selection by the given user and the one or more users in the social network of the given user based on the tag selection frequency distribution; comparing the retained one or more tags of the given user with the retained one or more tags of the one or more users in the social network of the given user; and inferring the given user'"'"'s interest based on the comparison of the retained one or more tags of the given user and the retained one or more tags of the one or more users in the social network of the given user. - View Dependent Claims (24, 25)
-
Specification