Domain-specific sentiment classification
First Claim
1. A method of constructing a domain-specific sentiment classifier for classifying sentiment expressed by documents in a specified domain, comprising:
- scoring sentiments expressed by one or more domain-specific documents, the scoring comprising;
determining that one or more of the domain-specific documents include an n-gram,calculating a score for the n-gram included in the one or more domain-specific documents, andcalculating a sentiment score for the one or more domain-specific documents based on the score for the n-gram included in the documents;
creating a domain-specific sentiment lexicon based at least in part on said scoring sentiments expressed by one or more domain-specific documents;
generating the domain-specific sentiment classifier based on the domain-specific sentiment lexicon; and
storing the domain-specific sentiment classifier.
2 Assignments
0 Petitions
Accused Products
Abstract
A domain-specific sentiment classifier that can be used to score the polarity and magnitude of sentiment expressed by domain-specific documents is created. A domain-independent sentiment lexicon is established and a classifier uses the lexicon to score sentiment of domain-specific documents. Sets of high-sentiment documents having positive and negative polarities are identified. The n-grams within the high-sentiment documents are filtered to remove extremely common n-grams. The filtered n-grams are saved as a domain-specific sentiment lexicon and are used as features in a model. The model is trained using a set of training documents which may be manually or automatically labeled as to their overall sentiment to produce sentiment scores for the n-grams in the domain-specific sentiment lexicon. This lexicon is used by the domain-specific sentiment classifier.
-
Citations
30 Claims
-
1. A method of constructing a domain-specific sentiment classifier for classifying sentiment expressed by documents in a specified domain, comprising:
-
scoring sentiments expressed by one or more domain-specific documents, the scoring comprising; determining that one or more of the domain-specific documents include an n-gram, calculating a score for the n-gram included in the one or more domain-specific documents, and calculating a sentiment score for the one or more domain-specific documents based on the score for the n-gram included in the documents; creating a domain-specific sentiment lexicon based at least in part on said scoring sentiments expressed by one or more domain-specific documents; generating the domain-specific sentiment classifier based on the domain-specific sentiment lexicon; and storing the domain-specific sentiment classifier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for constructing a domain-specific sentiment classifier for classifying sentiment expressed by documents in a specified domain, comprising:
-
a non-transitory computer readable storage medium storing computer-executable instructions, the computer executable instructions comprising; a scoring module configured to score sentiments expressed by one or more domain-specific documents, the scoring module configured to score by performing steps comprising; determining that one or more of the domain-specific documents include an n-gram, calculating a score for the n-gram included in the one or more domain-specific documents, and calculating a sentiment score for the one or more domain-specific documents based on the score for the n-gram included in the documents; a lexicon module configured to create a domain-specific sentiment lexicon based at least in part on said scoring sentiments expressed by one or more domain-specific documents; a classifier module configured to generate the domain-specific sentiment classifier based on the domain-specific sentiment lexicon; and a storing module configured to store the domain-specific sentiment classifier. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A computer program product having a computer-readable storage medium having computer-executable code for constructing a domain-specific sentiment classifier for classifying sentiment expressed by documents in a specified domain, the computer-executable code comprising:
-
a scoring module configured to score sentiments expressed by a plurality of domain-specific documents, the scoring module configured to score by performing steps comprising; determining that one or more of the domain-specific documents include an n-gram, calculating a score for the n-gram included in the one or more domain-specific documents, and calculating a sentiment score for the one or more domain-specific documents based on the score for the n-gram included in the documents; a lexicon module configured to create a domain-specific sentiment lexicon based at least in part on said scoring sentiments expressed by one or more domain-specific documents; a classifier module configured to generate the domain-specific sentiment classifier based on the domain-specific sentiment lexicon; and a storing module configured to store the domain-specific sentiment classifier. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30)
-
Specification