Method and system for extracting opinions from text documents
First Claim
1. A method for extracting opinions about a subject of interest from a text document having a plurality of sentences, the subject associated with a plurality of features, the method comprising the steps of:
- extracting from the document feature terms related to the features most relevant to the subject;
for each sentence referring to a feature term, determining whether the sentence includes an opinion polarity about the feature term; and
for each sentence referring to the subject, determining whether the sentence includes an opinion polarity about the subject.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for extracting opinions about a subject of interest from a text document in which each sentence is analyzed individually to identify the opinions. The most relevant feature terms related to the subject are extracted from the document based on their relevancy scores. Candidate feature terms are definite noun phrases at the beginning of the sentences. For each sentence that refers to the subject or a feature term, the invention determines whether the sentence includes an opinion polarity about the subject or the feature term. The opinion polarity is detected by identifying opinion terms in the sentence using an opinion dictionary or an opinion rule base, parsing the sentence with an English parser to identify grammatical components in the sentence and their relationships, and finding a matching entry in the dictionary or the rule base.
198 Citations
22 Claims
-
1. A method for extracting opinions about a subject of interest from a text document having a plurality of sentences, the subject associated with a plurality of features, the method comprising the steps of:
-
extracting from the document feature terms related to the features most relevant to the subject;
for each sentence referring to a feature term, determining whether the sentence includes an opinion polarity about the feature term; and
for each sentence referring to the subject, determining whether the sentence includes an opinion polarity about the subject. - View Dependent Claims (2, 7, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
- 3. The method as recited in claim 3, wherein the opinion skeleton includes a feature term and an opinion term referring to said feature term.
-
21. A system for extracting opinions about a subject of interest from a text document having a plurality of sentences, the subject associated with a plurality of features, the system comprising:
-
means for extracting from the document feature terms related to the features most relevant to the subject;
for each sentence referring to a feature term, means for determining whether the sentence includes an opinion polarity about the feature term; and
for each sentence referring to the subject, means for determining whether the sentence includes an opinion polarity about the subject.
-
-
22. A computer-program product for use with a computer for extracting opinions about a subject of interest from a text document having a plurality of sentences, the subject associated with a plurality of features, the computer-program product comprising:
-
a computer-readable medium;
means, provided on the computer-readable medium, for extracting from the document feature terms related to the features most relevant to the subject;
means, provided on the computer-readable medium, for each sentence referring to a feature term, for determining whether the sentence includes an opinion polarity about the feature term; and
means, provided on the computer-readable medium, for each sentence referring to the subject, for determining whether the sentence includes an opinion polarity about the subject.
-
Specification