Domain-specific sentiment classification
First Claim
1. A method of constructing a domain-specific sentiment classifier for classifying sentiment expressed by documents in a specified domain, comprising:
- scoring sentiments expressed by a plurality of domain-specific documents responsive to a domain-independent sentiment lexicon;
identifying a set of high-sentiment documents responsive to the scoring;
creating a domain-specific sentiment lexicon based at least in part on the set of high-sentiment documents;
generating the domain-specific sentiment classifier responsive to the domain-specific sentiment lexicon; and
storing the domain-specific sentiment classifier.
2 Assignments
0 Petitions
Accused Products
Abstract
A domain-specific sentiment classifier that can be used to score the polarity and magnitude of sentiment expressed by domain-specific documents is created. A domain-independent sentiment lexicon is established and a classifier uses the lexicon to score sentiment of domain-specific documents. Sets of high-sentiment documents having positive and negative polarities are identified. The n-grams within the high-sentiment documents are filtered to remove extremely common n-grams. The filtered n-grams are saved as a domain-specific sentiment lexicon and are used as features in a model. The model is trained using a set of training documents which may be manually or automatically labeled as to their overall sentiment to produce sentiment scores for the n-grams in the domain-specific sentiment lexicon. This lexicon is used by the domain-specific sentiment classifier.
79 Citations
32 Claims
-
1. A method of constructing a domain-specific sentiment classifier for classifying sentiment expressed by documents in a specified domain, comprising:
-
scoring sentiments expressed by a plurality of domain-specific documents responsive to a domain-independent sentiment lexicon; identifying a set of high-sentiment documents responsive to the scoring; creating a domain-specific sentiment lexicon based at least in part on the set of high-sentiment documents; generating the domain-specific sentiment classifier responsive to the domain-specific sentiment lexicon; and storing the domain-specific sentiment classifier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for constructing a domain-specific sentiment classifier for classifying sentiment expressed by documents in a specified domain, comprising:
-
a computer-readable storage medium storing executable modules comprising; a scoring module configured to score sentiments expressed by a plurality of domain-specific documents responsive to a domain-independent sentiment lexicon; an analysis module configured to identify a set of high-sentiment documents responsive to the scoring; a lexicon module configured to create a domain-specific sentiment lexicon based at least in part on the set of high-sentiment documents; a classifier module configured to generate the domain-specific sentiment classifier responsive to the domain-specific sentiment lexicon; and a storing module configured to store the domain-specific sentiment classifier; and a processor configured to execute the modules stored by the computer-readable storage medium. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A computer program product having a computer-readable storage medium having computer-executable code for constructing a domain-specific sentiment classifier for classifying sentiment expressed by documents in a specified domain, the computer-executable code comprising:
-
a scoring module configured to score sentiments expressed by a plurality of domain-specific documents responsive to a domain-independent sentiment lexicon; an analysis module configured to identify a set of high-sentiment documents responsive to the scoring; a lexicon module configured to create a domain-specific sentiment lexicon based at least in part on the set of high-sentiment documents; a classifier module configured to generate the domain-specific sentiment classifier responsive to the domain-specific sentiment lexicon; and a storing module configured to store the domain-specific sentiment classifier. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32)
-
Specification