DATA PROCESSING DEVICE, DATA PROCESSING METHOD, AND DATA PROCESSING PROGRAM
1 Assignment
0 Petitions
Accused Products
Abstract
[PROBLEMS] To provide a data processing device such as a text mining device capable of extracting characteristic structures properly even in case a plurality of words indicating identical contents or a plurality of words semantically associated are contained in input data. [MEANS FOR SOLVING PROBLEMS] Association node extraction unit (22) of a text mining device (10) extracts association nodes containing semantically associated words from a graph obtained as a result of syntax analysis. Association node joint unit (23) transforms the graph by joint of a part of or a whole of the association nodes. Characteristic structure extraction unit (24) extracts a characteristic structure from the graph transformed by the association node joint unit.
38 Citations
34 Claims
-
1-16. -16. (canceled)
-
17. A data processing device generating a graph which expresses an input data structure by a plurality of nodes having a single word as content thereof and by a dependency branch connecting two nodes in a dependent relationship within the plurality of nodes, and extracting a characteristic structure characterizing the input data from the graph, the device comprising:
-
an association node extraction unit for extracting association nodes, which are nodes semantically associated with each other, from the nodes; an association node joint unit for transforming the graph by joint of a part of or a whole of the association nodes; and a characteristic structure extraction unit for extracting the characteristic structure from a transformed graph by the association node joint unit. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A data processing means for generating a graph which expresses an input data structure by a plurality of nodes having a single word as content thereof and by a dependency branch connecting two nodes in a dependent relationship within the plurality of nodes, and extracting a characteristic structure characterizing the input data from the graph, the means comprising:
-
an association node extraction means for extracting association nodes, which are nodes semantically associated with each other, from the nodes; an association node joint means for transforming the graph by joint of a part of or a whole of the association nodes; and a characteristic structure extraction means for extracting the characteristic structure from a transformed graph by the association node joint unit.
-
-
30. A data processing method generating a graph which expresses an input data
structure by a plurality of nodes having a single word as content thereof and by a dependency branch which connects two nodes in a dependent relationship within the plurality of nodes, and extracting a characteristic structure characterizing the input data from the graph, the method comprising: -
extracting association nodes, which is nodes semantically associated, from the nodes; transforming the graph by joint of a part of or a whole of the association nodes; and extracting the characteristic structure from the transformed graph.
-
-
31. A data processing program making a computer execute the functions of:
-
generating a graph which expresses an input data structure by a plurality of nodes having a single word as content thereof and by a dependency branch connecting two nodes in a dependent relationship within the plurality of nodes; extracting a characteristic structure characterizing the input data from the graph; association node extraction for extracting association nodes, which are nodes semantically associated, from the nodes; association node joint for transforming the graph by joint of a part of or a whole of the association nodes; and characteristic structure extraction for extracting the characteristic structure from the transformed graph.
-
-
32. A data processing device,
expressing a dependent relationship between words in a text by a first type of branch; -
expressing a relationship of semantic similarity between words by a second type of branch; and determining a characteristic part of the text by analysis of a graph structure including the first type of branch and the second type of branch, distinguishing the first and the second type of branches.
-
-
33. A data processing means for:
-
expressing a dependent relationship between words in a text by a first type of branch; expressing a relationship of semantic similarity between words by a second type of branch; and determining a characteristic part of the text by analysis of a graph structure including the first type of branch and the second type of branch, distinguishing the first and the second type of branches.
-
-
34. A data processing method for which a data processing device for determining a characteristic part of a text by analysis of a dependency relationship between words in the text, the method comprising:
determining the characteristic part in the text, when there are a plurality of semantically similar words, by joint of dependency on the plurality of semantically similar words into any of the plurality of words.
Specification