×

Online active learning in user-generated content streams

  • US 9,967,218 B2
  • Filed: 10/26/2011
  • Issued: 05/08/2018
  • Est. Priority Date: 10/26/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method for delivering modified user generated content for display on client devices, comprising the operations of:

  • receiving, at one or more servers over a network, content that is user generated content from an online stream at a website, the content including text;

    converting, by a machine process executed at the one or more servers, the content into an elemental representation using a bag of words model;

    applying a probit model to the elemental representation to obtain a predictive probability that the content is abusive or not abusive, the machine process further includes, calculating an importance weight for the probit model based on the elemental representation, the importance weight is modeled as a multivariate Gaussian distribution with a mean and a covariance matrix;

    creating a probabilistic queue for delivering the content to a human labeler for acquiring a label for the content, wherein placement of the content within the probabilistic queue depends on the predictive probability that the content is abusive or not abusive;

    updating the probit model using the elemental representation, the importance weight, and the label acquired from the human labeler, the updating the probit model includes calculating an updated mean and an updated covariance matrix for the multivariate Gaussian distribution of the importance weight based on the label;

    receiving, at the one or more servers, a request from a client device for the online stream at the website, the online stream including the content;

    applying the probit model having been updated to the content and removing the content from the online stream to produce a modified online stream, the removing is based on the predictive probability that the content is abusive as calculated by the probit model having been updated; and

    sending, from the one or more servers, the modified online stream to the client device for display.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×