×

Iterative classifier training on online social networks

  • US 10,552,759 B2
  • Filed: 12/01/2014
  • Issued: 02/04/2020
  • Est. Priority Date: 12/01/2014
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method comprising, by one or more computing devices associated with an online social network:

  • accessing, by one or more of the computing devices, a first set of objects posted to the online social network by a plurality of client systems associated with a plurality of users of the online social network, the objects being stored in one or more data stores associated with the online social network, each object being associated with one or more comments, wherein each comment is posted to the online social network with respect to the corresponding object by a client system associated with a user of the online social network;

    generating, by one or more of the computing devices, a second set of objects by adding one or more objects in the first set of objects meeting a first filtering criteria to the second set;

    scoring, by one or more of the computing devices, each object in the second set of objects by comparing the comments associated with each object to a pre-determined set of terms associated with a first object-classification;

    generating, by one or more of the computing devices, a training set of objects from the second set of objects by selecting each object from the second set of objects having a score greater than a first threshold score, each object in the training set being pre-classified with the first object-classification;

    generating, by one or more of the computing devices, an object-classifier algorithm for the first object-classification, the object-classifier algorithm comprising one or more features each comprising a text expression and stored in association with a feature score indicating a level of correlation between the corresponding feature and the first object-classification and being determined through an iterative training process performed one or more times, each iteration of the iterative training process comprising;

    training an initial object-classifier algorithm based on the comments associated with the objects in the training set of objects;

    accessing a third set of objects posted to the online social network by a plurality of client systems associated with a plurality of users of the online social network, the objects being stored in one or more data stores associated with the online social network, each object being associated with one or more comments, wherein each comment is posted to the online social network with respect to the corresponding object by a client system associated with a user of the online social network;

    classifying, using the initial object-classifier algorithm, each object in the third set of objects based on an analysis of the comments associated with each object, one or more of the objects in the third set of objects being classified with the first object-classification, wherein the analysis comprises comparing the comments with the features;

    training a revised object-classifier algorithm based on the comments associated with the objects in the third set of objects having the first object-classification, wherein the training comprises revising one or more features of the initial object-classifier algorithm;

    accessing a fourth set of objects stored in one or more data stores associated with the online social network, the fourth set of objects being generated by applying a second filtering criteria to a fifth set of objects posted to the online social network by a plurality of client systems associated with a plurality of users, the objects being stored in one or more data stores associated with the online social network, each object being associated with one or more comments, wherein each comment is posted to the online social network with respect to the corresponding object by a client system associated with a user of the online social network;

    classifying, using the revised object-classifier algorithm, each object in the fourth set of objects based on an analysis of the comments associated with each object, one or more objects in the fourth set of objects being classified with the first object-classification, wherein the analysis comprises comparing the comments with the modified features;

    determining a level of precision associated with the object-classifier algorithm; and

    if the determined level of precision does not satisfy a specified threshold level of precision;

    then, generating a sixth set of objects from the fourth set of objects by selecting each object from the fourth set of objects having a score greater than a second threshold score, each object in the sixth set of objects being classified with the first object-classification, wherein the sixth set of objects is to be used as the training set in a next iteration of the iterative training process;

    else, terminating the iterative training process and storing the revised object-classifier algorithm; and

    classifying, using the generated object-classifier algorithm, a particular object posted by a client system associated with a particular user, wherein the particular object is classified with the first object-classification; and

    determining a status of the particular user posting the particular object based at least in part on the classification of the particular object.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×