Classification engine for learning properties of words and multi-word expressions
First Claim
1. A method comprising:
- receiving a first plurality of sentences comprising a first word that is associated with a class and a second plurality of sentences comprising a second word that is not associated with the class;
training a classifier using positive training data for the class that is based on the first plurality of sentences and negative training data for the class that is based on the second plurality of sentences;
determining a measure of correlation between a third word and the class by using a sentence comprising the third word as an input to the classifier; and
using the measure of correlation to perform an action selected from a group comprising;
selecting content to provide to a user;
determining an automatic chat response;
orfiltering a set of content.
2 Assignments
0 Petitions
Accused Products
Abstract
Method and apparatus for training and using a classifier for words. Embodiments include receiving a first plurality of sentences comprising a first word that is associated with a class and a second plurality of sentences comprising a second word that is not associated with the class. Embodiments include training a classifier using positive training data for the class that is based on the first plurality of sentences and negative training data for the class that is based on the second plurality of sentences. Embodiments include determining a measure of correlation between a third word and the class by using a sentence comprising the third word as an input to the classifier. Embodiments include using the measure of correlation to perform an action selected from the following list: selecting content to provide to a user; determining an automatic chat response; or filtering a set of content.
6 Citations
20 Claims
-
1. A method comprising:
-
receiving a first plurality of sentences comprising a first word that is associated with a class and a second plurality of sentences comprising a second word that is not associated with the class; training a classifier using positive training data for the class that is based on the first plurality of sentences and negative training data for the class that is based on the second plurality of sentences; determining a measure of correlation between a third word and the class by using a sentence comprising the third word as an input to the classifier; and using the measure of correlation to perform an action selected from a group comprising; selecting content to provide to a user; determining an automatic chat response;
orfiltering a set of content. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system, comprising:
- one or more processors; and
a non-transitory computer-readable medium storing instructions that, when executed by the one or more processors, cause the one or more processors to perform a method, the method comprising;receiving a first plurality of sentences comprising a first word that is associated with a class and a second plurality of sentences comprising a second word that is not associated with the class; training a classifier using positive training data for the class that is based on the first plurality of sentences and negative training data for the class that is based on the second plurality of sentences; determining a measure of correlation between a third word and the class by using a sentence comprising the third word as an input to the classifier; and using the measure of correlation to perform an action selected from a group comprising; selecting content to provide to a user; determining an automatic chat response;
orfiltering a set of content. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
- one or more processors; and
-
20. A non-transitory computer-readable storage medium having computer-readable program code embodied therewith, the computer-readable program code executable by one or more computer processors to perform a method, the method comprising:
-
receiving a first plurality of sentences comprising a first word that is associated with a class and a second plurality of sentences comprising a second word that is not associated with the class; training a classifier using positive training data for the class that is based on the first plurality of sentences and negative training data for the class that is based on the second plurality of sentences; determining a measure of correlation between a third word and the class by using a sentence comprising the third word as an input to the classifier; and using the measure of correlation to perform an action selected from a group comprising; selecting content to provide to a user; determining an automatic chat response;
orfiltering a set of content.
-
Specification