Extracting and grouping opinions from text documents
First Claim
Patent Images
1. A method of analyzing expressed opinions comprising the steps of:
- parsing words of at least one text-based document as parts of speech;
extracting regular expressions from the document by matching at least one regular expression rule with the parsed parts of speech; and
categorizing extracted regular expressions into representative categories of semantic orientation by analyzing the words comprising the extracted regular expressions.
1 Assignment
0 Petitions
Accused Products
Abstract
Opinions about a topic are extracted from a corpus of text documents. Opinions are extracted based on rules defining regular expressions for parts-of-speech tags. Opinions are grouped based on their semantic orientation as favourable, unfavourable or neutral. A balanced and accurate assessment of sentiment towards a topic can thus be determined.
301 Citations
28 Claims
-
1. A method of analyzing expressed opinions comprising the steps of:
-
parsing words of at least one text-based document as parts of speech;
extracting regular expressions from the document by matching at least one regular expression rule with the parsed parts of speech; and
categorizing extracted regular expressions into representative categories of semantic orientation by analyzing the words comprising the extracted regular expressions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer program product comprising computer software recorded on a computer-readable medium for performing the steps of:
-
parsing words of at least one text-based document as parts of speech;
extracting regular expressions from the document by matching at least one regular expression nile with the parsed parts of speech; and
categorizing extracted regular expressions into representative categories of semantic orientation by analyzing the words comprising the extracted regular expressions, - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
11. (canceled)
-
20. A computer system comprising computer software recorded on a computer-readable medium for performing die steps of:
-
parsing words of at least one text-based document as parts of speech;
extracting regular expressions from the document by matching at least one regular expression rule with the parsed parts of speech; and
categorizing extracted regular expressions into representative categories of semantic orientation by analyzing the words comprising the extracted regular expressions. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28)
-
Specification