Online Active Learning in User-Generated Content Streams
First Claim
1. A method for monitoring user generated content, comprising the operations of:
- receiving content posted to an online stream at a website;
converting the content into an elemental representation;
inputting the elemental representation into a probit model to obtain a predictive probability that the content is abusive;
calculating an importance weight based at least in part on the elemental representation;
updating the probit model using the elemental representation, an importance weight, and an acquired label, wherein the updating occurs if a condition is met and wherein the condition depends at least in part on an instrumental distribution; and
removing the content from the online stream, based at least in part on the predictive probability if an acquired label is unavailable, wherein each operation of the method is executed by one or more processors.
6 Assignments
0 Petitions
Accused Products
Abstract
Software for online active learning receives content posted to an online stream at a website. The software converts the content into an elemental representation and inputs the elemental representation into a probit model to obtain a predictive probability that the content is abusive. The software also calculates an importance weight based on the elemental representation. And the software updates the probit model using the content, the importance weight, and an acquired label if a condition is met. The condition depends on an instrumental distribution. The software removes the content from the online stream if a condition is met. The condition depends on the predictive probability, if an acquired label is unavailable.
24 Citations
20 Claims
-
1. A method for monitoring user generated content, comprising the operations of:
-
receiving content posted to an online stream at a website; converting the content into an elemental representation; inputting the elemental representation into a probit model to obtain a predictive probability that the content is abusive; calculating an importance weight based at least in part on the elemental representation; updating the probit model using the elemental representation, an importance weight, and an acquired label, wherein the updating occurs if a condition is met and wherein the condition depends at least in part on an instrumental distribution; and removing the content from the online stream, based at least in part on the predictive probability if an acquired label is unavailable, wherein each operation of the method is executed by one or more processors. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-readable storage medium persistently storing a program, wherein the program, when executed, instructs a processor to perform the following operations:
-
receive content posted to an online stream at a website; convert the content into an elemental representation; input the elemental representation into a probit model to obtain a predictive probability that the content is abusive; calculate an importance weight based at least in part on the elemental representation; update the probit model with the elemental representation, the importance weight, and an acquired label, wherein the update occurs if a condition is met and wherein the condition depends at least in part on an instrumental distribution; and remove the content from the online stream, based at least in part on the predictive probability if an acquired label is unavailable. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for displaying user generated content, comprising the operations of:
-
receiving content posted to an online stream at a website; converting the content into an elemental representation; inputting the elemental representation into a probit model to obtain a predictive probability that the content is interesting; calculating an importance weight based at least in part on the elemental representation; updating the probit model with the elemental representation, the importance weight, and an acquired label, wherein the updating occurs if a condition is met and wherein the condition depends at least in part on an instrumental distribution; and relocating the content in the online stream, based at least in part on the predictive probability if an acquired label is unavailable, wherein each operation of the method is executed by one or more processors. - View Dependent Claims (20)
-
Specification