Systems and methods for detecting sentiment-based topics
First Claim
1. A computer implemented method for analyzing sentiment concerning a subject, comprising:
- collecting a plurality of objects from at least one external content repository, wherein the collected objects form a content database;
extracting snippets related to the subject from the content database;
calculating a sentiment score for at least one snippet that has been extracted;
for the at least one snippet for which the sentiment score has been calculated, classifying the snippet into at least one sentiment category;
creating a sentiment taxonomy using the sentiment categories, wherein the sentiment taxonomy classifies the snippets as positive, negative or neutral;
identifying topic words within the sentiment taxonomy;
classifying the topic words as sentiment topic words or non-sentiment topic words;
identifying the frequency of the non-sentiment topic words in each of the sentiment categories;
identifying the importance of the non-sentiment topic words in each of the sentiment categories; and
,ranking the non-sentiment topic words, wherein the rank for a non-sentiment topic word is calculated by combining the frequency of the non-sentiment topic word in each of the sentiment categories with the importance of the non-sentiment topic word.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for analyzing sentiment comprising: collecting an object from an external content repository, the collected objects forming a content database; extracting a snippet related to the subject from the content database; calculating a sentiment score for the snippet; classifying the snippet into a sentiment category; creating sentiment taxonomy using the sentiment categories, the sentiment taxonomy classifying the snippets as positive, negative or neutral; identifying topic words within the sentiment taxonomy; classifying the topic words as a sentiment topic word candidates or a non-sentiment topic word candidate, filtering the non-sentiment topic word candidates; identifying the frequency of the non-sentiment topic words in each of the sentiment categories; identifying the importance of the non-sentiment topic word for each of the sentiment categories; and, ranking the topic word, wherein the rank is calculated by combining the frequency of the topic words in each of the categories with its importance.
59 Citations
15 Claims
-
1. A computer implemented method for analyzing sentiment concerning a subject, comprising:
-
collecting a plurality of objects from at least one external content repository, wherein the collected objects form a content database; extracting snippets related to the subject from the content database; calculating a sentiment score for at least one snippet that has been extracted; for the at least one snippet for which the sentiment score has been calculated, classifying the snippet into at least one sentiment category; creating a sentiment taxonomy using the sentiment categories, wherein the sentiment taxonomy classifies the snippets as positive, negative or neutral; identifying topic words within the sentiment taxonomy; classifying the topic words as sentiment topic words or non-sentiment topic words; identifying the frequency of the non-sentiment topic words in each of the sentiment categories; identifying the importance of the non-sentiment topic words in each of the sentiment categories; and
,ranking the non-sentiment topic words, wherein the rank for a non-sentiment topic word is calculated by combining the frequency of the non-sentiment topic word in each of the sentiment categories with the importance of the non-sentiment topic word. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system for analyzing sentiment concerning a subject, comprising a processor and a storage coupled to the processor, the processor configured to:
-
collect a plurality of objects from at least one external content repository, wherein the collected objects form a content database; extract snippets related to the subject from the content database; calculate a sentiment score for at least one snippet that has been extracted; for the at least one snippet for which the sentiment score has been calculated, classify the snippet into at least one sentiment category; create a sentiment taxonomy using the sentiment categories, wherein the sentiment taxonomy classifies the snippets as positive, negative or neutral; identify topic words within the sentiment taxonomy; classify the topic words as sentiment topic words or non-sentiment topic words; identify the frequency of the non-sentiment topic words in each of the sentiment categories; identify the importance of the non-sentiment topic words in each of the sentiment categories; and
,rank the non-sentiment topic words, wherein the rank for a non-sentiment topic word is calculated by combining the frequency of the non-sentiment topic word in each of the sentiment categories with the importance of the non-sentiment topic word. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A computer program product comprising a non-transitory computer useable storage medium to store a computer readable program, wherein the computer readable program, when executed on a computer, causes the computer to perform for operations for determining analyzing sentiment concerning a subject, comprising:
-
collecting a plurality of objects from at least one external content repository, wherein the collected objects form a content database; extracting snippets related to the subject from the content database; calculating a sentiment score for at least one snippet that has been extracted; for the at least one snippet for which the sentiment score has been calculated, classifying the snippet into at least one sentiment category; creating a sentiment taxonomy using the sentiment categories, wherein the sentiment taxonomy classifies the snippets as positive, negative or neutral; identifying topic words within the sentiment taxonomy; classifying the topic words as sentiment topic words or non-sentiment topic words; identifying the frequency of the non-sentiment topic words in each of the sentiment categories; identifying the importance of the non-sentiment topic words in each of the sentiment categories; and
,ranking the non-sentiment topic words, wherein the rank for a non-sentiment topic word is calculated by combining the frequency of the non-sentiment topic word in each of the sentiment categories with the importance of the non-sentiment topic word. - View Dependent Claims (12, 13, 14, 15)
-
Specification