APPARATUS AND METHOD FOR EXTRACTING SEMANTIC TOPIC
First Claim
1. A method for extracting a semantic topic from one or more document sets in which opinions about an object are described using an apparatus capable of calculating a probability distribution, the method comprising:
- (a) extracting a word distribution about sentiment global topics and a word distribution about sentiment local topics;
(b) extracting a global topic distribution, a sentiment distribution about the global topic, a local topic distribution, and a sentiment distribution about the local topic with respect to each document of the document sets;
(c) performing statistical inference about each of the distributions extracted in the step (a) and the step (b);
(d) extracting a global or local topic and a sentiment relevant to the global or local topic from the global topic distribution and the sentiment distribution about the global topic or the local topic distribution and the sentiment distribution about the local topic with respect to each word in each document of the document sets; and
(e) extracting a word from the word distribution about sentiment global topics or the word distribution about sentiment local topics on the basis of the topic and sentiment extracted in the step (d).
1 Assignment
0 Petitions
Accused Products
Abstract
In accordance with a first exemplary embodiment, there is provided a method for extracting semantic topics from document sets in which opinions about an object are described using an apparatus capable of calculating a probability distribution. The method include (a) extracting word distributions about sentiment global topics and sentiment local topics; (b) extracting a global topic distribution, a local topic distribution and sentiment distributions about the global and local topics from the document sets; (c) performing statistical inference about each of the distributions extracted in the step (a) and the step (b); (d) extracting a global or local topic and a sentiment relevant to the global or local topic from the distributions of the inference performed in the step (c); and (e) extracting a word from the word distributions about sentiment global topics or sentiment local topics on the basis of the topic and sentiment extracted in the step (d).
34 Citations
11 Claims
-
1. A method for extracting a semantic topic from one or more document sets in which opinions about an object are described using an apparatus capable of calculating a probability distribution, the method comprising:
-
(a) extracting a word distribution about sentiment global topics and a word distribution about sentiment local topics; (b) extracting a global topic distribution, a sentiment distribution about the global topic, a local topic distribution, and a sentiment distribution about the local topic with respect to each document of the document sets; (c) performing statistical inference about each of the distributions extracted in the step (a) and the step (b); (d) extracting a global or local topic and a sentiment relevant to the global or local topic from the global topic distribution and the sentiment distribution about the global topic or the local topic distribution and the sentiment distribution about the local topic with respect to each word in each document of the document sets; and (e) extracting a word from the word distribution about sentiment global topics or the word distribution about sentiment local topics on the basis of the topic and sentiment extracted in the step (d). - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. An apparatus for extracting a semantic topic, comprising:
-
a document storage unit that stores one or more document sets in which opinions about an object are described; and a topic extraction unit that extracts a topic including a sentiment-oriented ratable aspect of the object and a sentiment about the topic from the document sets stored in the document storage unit, wherein the topic extraction unit extracts a word distribution about sentiment topics, extracts a topic distribution and a sentiment distribution with respect to each document of the document sets, and extracts a topic and a sentiment from each of the extracted distributions with respect to each word in each document of the document sets. - View Dependent Claims (8, 9, 10, 11)
-
Specification