STATISTICAL FEATURE ENGINEERING OF USER ATTRIBUTES
First Claim
1. A method comprising:
- receiving user profile information for a user of a social networking system;
generating a profile vector associated with a user attribute based on the user profile information, the profile vector comprising profile bin scores for each of two or more attribute bins in a distribution of values of the user attribute, each attribute bin corresponding to a range of values in the distribution of values and each profile bin score indicating how closely the user profile information associated with the user attribute matches attributes characteristic of the ranges corresponding to the two or more attribute bins;
maintaining a plurality of actions performed by the user in the social networking system;
generating a behavior vector associated with the user attribute based on the plurality of actions performed by the user, the behavior vector comprising behavior bin scores for the each of the two or more attribute ranges corresponding to the two or more attribute bins for the user attribute, each behavior bin score indicating how closely the plurality of actions performed by the user associated with the user attribute matches attributes characteristic of the ranges corresponding to the two or more attribute bins;
generating a difference vector based on the difference between the profile vector and the behavior vector, the difference vector comprising difference scores for the each of the two or more attribute ranges corresponding to the two or more attribute bins; and
training a model configured for providing content to the user of the social networking system based at least in part on the user attribute, the model trained using the difference vector as a feature.
2 Assignments
0 Petitions
Accused Products
Abstract
A method trains a model for providing content items to users of a social networking system. The system generates profile vectors based on user profile information such as demographic data and personal data. The system logs actions performed by users on the social networking system and generates behavior vectors based on the logged actions. The profile vectors and behavior vectors are each associated with a user attribute, e.g., the age or gender of a user. The system generates a difference vector based on a profile vector and a behavior vector. The difference vector is then used as a feature to train the model using machine learning techniques. The trained model may select content items that a target user is most likely to be interested in and interact with.
-
Citations
21 Claims
-
1. A method comprising:
-
receiving user profile information for a user of a social networking system; generating a profile vector associated with a user attribute based on the user profile information, the profile vector comprising profile bin scores for each of two or more attribute bins in a distribution of values of the user attribute, each attribute bin corresponding to a range of values in the distribution of values and each profile bin score indicating how closely the user profile information associated with the user attribute matches attributes characteristic of the ranges corresponding to the two or more attribute bins; maintaining a plurality of actions performed by the user in the social networking system; generating a behavior vector associated with the user attribute based on the plurality of actions performed by the user, the behavior vector comprising behavior bin scores for the each of the two or more attribute ranges corresponding to the two or more attribute bins for the user attribute, each behavior bin score indicating how closely the plurality of actions performed by the user associated with the user attribute matches attributes characteristic of the ranges corresponding to the two or more attribute bins; generating a difference vector based on the difference between the profile vector and the behavior vector, the difference vector comprising difference scores for the each of the two or more attribute ranges corresponding to the two or more attribute bins; and training a model configured for providing content to the user of the social networking system based at least in part on the user attribute, the model trained using the difference vector as a feature. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product comprising a computer readable storage medium having instructions encoded therein that, when executed by a processor, cause the processor to:
-
receive user profile information for a user of a social networking system; generate a profile vector associated with a user attribute based on the user profile information, the profile vector comprising profile bin scores for each of two or more attribute bins in a distribution of values of the user attribute, each attribute bin corresponding to a range of values in the distribution of values and each profile bin score indicating how closely the user profile information associated with the user attribute matches attributes characteristic of the ranges corresponding to the two or more attribute bins; maintain a plurality of actions performed by the user in the social networking system; generate a behavior vector associated with the user attribute based on the plurality of actions performed by the user, the behavior vector comprising behavior bin scores for the each of the two or more attribute ranges corresponding to the two or more attribute bins for the user attribute, each behavior bin score indicating how closely the plurality of actions performed by the user associated with the user attribute matches attributes characteristic of the ranges corresponding to the two or more attribute bins; generate a difference vector based on the difference between the profile vector and the behavior vector, the difference vector comprising difference scores for the each of the two or more attribute ranges corresponding to the two or more attribute bins; and train a model configured for providing content to the user of the social networking system based at least in part on the user attribute, the model trained using the difference vector as a feature. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system comprising a web server configured for:
-
receiving user profile information for a user of a social networking system; generating a profile vector associated with a user attribute based on the user profile information, the profile vector comprising profile bin scores for each of two or more attribute bins in a distribution of values of the user attribute, each attribute bin corresponding to a range of values in the distribution of values and each profile bin score indicating how closely the user profile information associated with the user attribute matches attributes characteristic of the ranges corresponding to the two or more attribute bins; maintaining a plurality of actions performed by the user in the social networking system; generating a behavior vector associated with the user attribute based on the plurality of actions performed by the user, the behavior vector comprising behavior bin scores for the each of the two or more attribute ranges corresponding to the two or more attribute bins for the user attribute, each behavior bin score indicating how closely the plurality of actions performed by the user associated with the user attribute matches attributes characteristic of the ranges corresponding to the two or more attribute bins; generating a difference vector based on the difference between the profile vector and the behavior vector, the difference vector comprising difference scores for the each of the two or more attribute ranges corresponding to the two or more attribute bins; and training a model configured for providing content to the user of the social networking system based at least in part on the user attribute, the model trained using the difference vector as a feature. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification