Classification of offensive words
First Claim
1. A computer-implemented method, comprising:
- receiving a text sample at a computing device, the text sample comprising a set of terms;
identifying, by the computing device, that a first term of the set of terms of the text sample is designated as a term that is potentially offensive in some but not all contexts;
after identifying that the first term of the set of terms of the text sample is designated as a term that is potentially offensive in some but not all contexts, providing the text sample to an offensive term classifier, wherein the offensive term classifier is trained to process text samples containing the first term and to generate indications of whether, in respective contexts defined by the text samples, the first term is to be selectively redacted from a representation of the text sample that is output;
obtaining, by the computing device and from the offensive term classifier, an indication that, in a particular context defined by the text sample, the first term is used in an offensive manner;
in response to obtaining the indication that, in the particular context defined by the text sample, the first term is used in the offensive manner, redacting the first term from the text sample to generate a redacted version of the text sample;
presenting, by the computing device, the redacted version of the text sample;
after presenting the redacted version of the text sample, receiving a user input to un-redact the first term; and
retraining the offensive term classifier using the user input as a training signal that indicates that the first term is to not be selectively redacted from representations of text samples.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method can include identifying a first set of text samples that include a particular potentially offensive term. Labels can be obtained for the first set of text samples that indicate whether the particular potentially offensive term is used in an offensive manner. A classifier can be trained based at least on the first set of text samples and the labels, the classifier being configured to use one or more signals associated with a text sample to generate a label that indicates whether a potentially offensive term in the text sample is used in an offensive manner in the text sample. The method can further include providing, to the classifier, a first text sample that includes the particular potentially offensive term, and in response, obtaining, from the classifier, a label that indicates whether the particular potentially offensive term is used in an offensive manner in the first text sample.
58 Citations
18 Claims
-
1. A computer-implemented method, comprising:
-
receiving a text sample at a computing device, the text sample comprising a set of terms; identifying, by the computing device, that a first term of the set of terms of the text sample is designated as a term that is potentially offensive in some but not all contexts; after identifying that the first term of the set of terms of the text sample is designated as a term that is potentially offensive in some but not all contexts, providing the text sample to an offensive term classifier, wherein the offensive term classifier is trained to process text samples containing the first term and to generate indications of whether, in respective contexts defined by the text samples, the first term is to be selectively redacted from a representation of the text sample that is output; obtaining, by the computing device and from the offensive term classifier, an indication that, in a particular context defined by the text sample, the first term is used in an offensive manner; in response to obtaining the indication that, in the particular context defined by the text sample, the first term is used in the offensive manner, redacting the first term from the text sample to generate a redacted version of the text sample; presenting, by the computing device, the redacted version of the text sample; after presenting the redacted version of the text sample, receiving a user input to un-redact the first term; and retraining the offensive term classifier using the user input as a training signal that indicates that the first term is to not be selectively redacted from representations of text samples. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. One or more non-transitory computer-readable media having instructions stored thereon that, when executed by one or more processors of a computing device, cause the one or more processors to perform operations comprising:
-
receiving a text sample at a computing device, the text sample comprising a set of terms; identifying, by the computing device, that a first term of the set of terms of the text sample is designated as a term that is potentially offensive in some but not all contexts; after identifying that the first term of the set of terms of the text sample is designated as a term that is potentially offensive in some but not all contexts, providing the text sample to an offensive term classifier, wherein the offensive term classifier is trained to process text samples containing the first term and to generate indications of whether, in respective contexts defined by the text samples, the first term is to be selectively redacted from a representation of the text sample that is output; obtaining, by the computing device and from the offensive term classifier, an indication that, in a particular context defined by the text sample, the first term is used in an offensive manner; in response to obtaining the indication that, in the particular context defined by the text sample, the first term is used in the offensive manner, redacting the first term from the text sample to generate a redacted version of the text sample; presenting, by the computing device, the redacted version of the text sample; after presenting the redacted version of the text sample, receiving a user input to un-redact the first term; and retraining the offensive term classifier using the user input as a training signal that indicates that the first term is to not be selectively redacted from representations of text samples. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A computing device comprising:
-
one or more processors; and one or more computer-readable media having instructions stored thereon that, when executed by the one or more processors, cause the one or more processors to perform operations comprising; receiving a text sample at a computing device, the text sample comprising a set of terms; identifying, by the computing device, that a first term of the set of terms of the text sample is designated as a term that is potentially offensive in some but not all contexts; after identifying that the first term of the set of terms of the text sample is designated as a term that is potentially offensive in some but not all contexts, providing the text sample to an offensive term classifier, wherein the offensive term classifier is trained to process text samples containing the first term and to generate indications of whether, in respective contexts defined by the text samples, the first term is to be selectively redacted from a representation of the text sample that is output; obtaining, by the computing device and from the offensive term classifier, an indication that, in a particular context defined by the text sample, the first term is used in an offensive manner; in response to obtaining the indication that, in the particular context defined by the text sample, the first term is used in the offensive manner, redacting the first term from the text sample to generate a redacted version of the text sample; presenting, by the computing device, the redacted version of the text sample; after presenting the redacted version of the text sample, receiving a user input to un-redact the first term; and retraining the offensive term classifier using the user input as a training signal that indicates that the first term is to not be selectively redacted from representations of text samples.
-
Specification