Method for updating a knowledge base of a sentiment analysis system
First Claim
1. A method, in a data processing apparatus comprising at least one processor and at least one memory, the at least one memory comprising instructions that are executed by the at least one processor to cause the at least one processor to be configured to implement a sentiment analysis system for updating a knowledge base of the sentiment analysis system, the knowledge base being operable for storing natural language terms and a score value related to each natural language term, the score value characterizing the sentiment of the natural language term, the method comprising:
- receiving, by the sentiment analysis system, messages comprising natural language from one or more public social media systems;
deciding, by the sentiment analysis system, using content of the knowledge base, whether at least one message of the received messages has a positive sentiment or a negative sentiment;
extracting, by the sentiment analysis system, a term rom the message that is not present in the knowledge base;
based on a frequency of occurrence of the term in the received messages and the sentiment of the messages in which the term occurs, calculating, by the sentiment analysis system, a score value of the term wherein the score value of the term is defined using the following formula;
1 Assignment
0 Petitions
Accused Products
Abstract
A mechanism is provided updating a knowledge base of a sentiment analysis system, the knowledge base being operable for storing natural language terms and a score value related to each natural language term, the score value characterizing the sentiment of the natural language term. Messages comprising natural language are received. Using content of the knowledge base, a decision is made as to whether at least one message of the received messages has a positive sentiment or a negative sentiment. A term is extracted from the message that is not present in the knowledge base. Based on a frequency of occurrence of the term in the received messages and the sentiment of the messages in which the term occurs, a score value of the term is calculated, and the term and the calculated score value are stored into the knowledge base.
9 Citations
17 Claims
-
1. A method, in a data processing apparatus comprising at least one processor and at least one memory, the at least one memory comprising instructions that are executed by the at least one processor to cause the at least one processor to be configured to implement a sentiment analysis system for updating a knowledge base of the sentiment analysis system, the knowledge base being operable for storing natural language terms and a score value related to each natural language term, the score value characterizing the sentiment of the natural language term, the method comprising:
-
receiving, by the sentiment analysis system, messages comprising natural language from one or more public social media systems; deciding, by the sentiment analysis system, using content of the knowledge base, whether at least one message of the received messages has a positive sentiment or a negative sentiment; extracting, by the sentiment analysis system, a term rom the message that is not present in the knowledge base; based on a frequency of occurrence of the term in the received messages and the sentiment of the messages in which the term occurs, calculating, by the sentiment analysis system, a score value of the term wherein the score value of the term is defined using the following formula; - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer program product comprising a computer readable storage medium having a computer readable program stored therein, wherein the computer readable storage medium is not a transitory signal per se, and wherein the computer readable program, when executed on a computing device, causes the computing device to implement a sentiment analysis system for updating a knowledge base of the sentiment analysis system, the knowledge base being operable for storing natural language terms and a score value related to each natural language term, the score value characterizing the sentiment of the natural language term, and further causes the computing device to:
-
receive, by the sentiment analysis system, messages comprising natural language from one or more public social media systems; decide, by the sentiment analysis system, using content of a knowledge base, whether at least one message of the received messages has a positive sentiment or a negative sentiment; extract, by the sentiment analysis system, a term from the message that is not present in the knowledge base; based on a frequency of occurrence of the term in the received messages and the sentiment of the messages in which the term occurs, calculate, by the sentiment analysis system, a score value of the term wherein the score value of the term is defined using the following formula; - View Dependent Claims (13, 14)
-
-
15. An apparatus comprising;
-
a processor; and a memory coupled to the processor, wherein the memory comprises instructions which, when executed by the processor, cause the processor to implement a sentiment analysis system for updating a knowledge base of the sentiment analysis system, the knowledge base being operable for storing natural language terms and a score value related to each natural language term, the score value characterizing the sentiment of the natural language term, and further causes the processor to; receive, by the sentiment analysis system, multiple messages comprising natural language from one or more public social media systems; decide, by the sentiment analysis system, using the content of a knowledge base, whether at least one message of the received messages has a positive or a negative sentiment; extract, by the sentiment analysis system, a term from the message that is not present in the knowledge base; based on a frequency of occurrence of the term in the messages and the sentiment of the messages in that the term occurs, calculate, by the sentiment analysis system, a score value of the term wherein the score value of the term is defined using the following formula; - View Dependent Claims (16, 17)
-
Specification