Text Mining Device, Method Thereof, and Program
First Claim
1. A text mining apparatus comprising:
- means for generating a sentence structure from an input document;
means for generating a similar structure of patterns having a similar meaning of a partial structure of the sentence structure by performing predetermined conversion operation, including at least change in connection of branches in a graph structure, of the partial structure; and
means for determining the patterns having the similar meaning as the identical pattern and detecting the pattern.
2 Assignments
0 Petitions
Accused Products
Abstract
Language analysis means 21 analyzes texts read from a text DB 11, and generates a sentence structure as the analysis result. Similar-structure generation adjustment means 25 generates, from an input of an input device, a determination item for determining whether or not the structures are identical every type of differences between the sentence structures. Similar-structure determination adjustment means 26 generates, from an input of the input device 6, a determination item for determining whether or not the difference between attribute values is ignored every type of attribute values. Similar-structure generating means 22 generates a similar structure of a partial structure forming the sentence structure obtained by language analysis means 21 in accordance with the determination item from the similar-structure generation adjustment means 25, and sets the generated similar structure as an equivalent class of the partial structure on the generation source. Frequent-similar-pattern detection means 24 ignores the attribute value in accordance with the determination item given from the similar-structure determination adjustment means 26, detects the frequent pattern on the basis of a set of equivalent classes from the similar-structure generating means 22, and outputs the frequent pattern to an output device 3.
25 Citations
25 Claims
-
1. A text mining apparatus comprising:
-
means for generating a sentence structure from an input document;
means for generating a similar structure of patterns having a similar meaning of a partial structure of the sentence structure by performing predetermined conversion operation, including at least change in connection of branches in a graph structure, of the partial structure; and
means for determining the patterns having the similar meaning as the identical pattern and detecting the pattern. - View Dependent Claims (2, 3)
-
-
4. A text mining apparatus comprising:
-
a storage unit that stores a set of documents as a text mining object;
an analyzing unit that reads and analyzes the document from the storage unit and obtains the sentence structure;
a similar-structure generating unit that performs predetermined modification operation, including at least change in connection of branches in a graph structure, of the partial structure of the sentence structure obtained by the analysis of the analyzing unit, and generates a similar structure of patterns having a similar meaning; and
a pattern detecting unit that uses the similar structure generated by the similar-structure generating unit as an equivalent class of the partial structure on the generation source, and detects the pattern. - View Dependent Claims (5, 6, 7)
-
-
8. A text mining apparatus comprising:
-
a storage unit that stores a set of documents as a text mining object;
an analyzing unit that reads and analyzes the document from the storage unit and obtains a sentence structure;
a similar-structure generation adjustment unit that generates a first determination item for determining, from a user input, whether or not the structures are identical one every type of differences between the sentence structures;
a similar-structure determination adjustment unit that generates a second determination item for determining, from a user input, whether or not the structures are identical ones every type of differences between attribute values;
a similar-structure generating unit that performs predetermined conversion operation of a partial structure of the sentence structure obtained by the analyzing unit in accordance with the first determination item generated by the similar-structure generation adjustment unit and generates similar structures having a similar meaning of the partial structure; and
a similar-pattern detecting unit that uses the similar structure generated by the similar-structure generating unit as an equivalent class of the partial structure on the generation source and detects the frequent pattern by ignoring the difference between the attribute values in accordance with the second determination item of the similar-structure determination adjustment unit. - View Dependent Claims (9, 10, 11)
-
-
12. A text mining method comprising:
-
a step of generating a sentence structure from an input document;
a step of generating a similar structure of patterns having a similar meaning of a partial structure of the sentence structure by performing predetermined conversion operation, including at least change in connection of branches in a graph structure, of the partial structure; and
a step of determining the patterns having the similar meaning as the identical pattern and detecting the pattern. - View Dependent Claims (13, 14)
-
-
15. A text mining method comprising:
-
a step of analyzing the document from a storage unit that stores a set of documents as a text mining object and obtaining a sentence structure;
a step of performing predetermined modification operation, including at least change in connection of branches in a graph structure of a partial structure of the sentence structure and generating a similar structure having patterns with a similar meaning;
a step of using the generated similar structures as an equivalent class of the partial structure on the generation source and detecting the pattern. - View Dependent Claims (16, 17, 18)
-
-
19. A text mining method comprising:
-
a step of analyzing a document from a storage unit that stores a set of documents as a text mining object and obtaining the sentence structure;
a step of generating, from a user input, a first determination item for determining whether or not the structures are identical ones every type of differences between sentence structures;
a step of generating, from a user input, a second determination item for determining whether or not the structures are identical ones every type of differences between attribute values;
a step of performing predetermined modification operation of the partial structure of the sentence structure obtained by the analyzing unit and generating a similar structure having a similar meaning of the partial structure in accordance with the generated first determination item; and
a step of using the generated similar structure as an equivalent class of the partial structure on the generation source and detecting the pattern by ignoring the difference between the attribute values in accordance with the second determination item. - View Dependent Claims (20, 21, 22)
-
-
23. A program for enabling a computer forming a text mining apparatus to execute:
-
processing for analyzing a document in a storage unit that stores a set of documents as a text mining object and obtaining a sentence structure;
processing for performing predetermined conversion operation of a partial structure of the sentence structure and generating a similar structure having a similar meaning of the partial structure, including at least change in connection of branches in a graph structure, and processing for using the generated similar structure as an equivalent class of the partial structure on the generation source and detecting a predetermined pattern.
-
-
24. A program for enabling a computer forming a text mining apparatus to execute:
-
processing for analyzing a document in a storage unit that stores a set of documents as a text mining object and obtaining a sentence structure;
processing for performing predetermined conversion operation, including at least change in connection of branches in a graph structure, to a similar structure of the sentence structure and generating the similar structure of patterns having a similar meaning of the partial structure; and
processing for using the generated similar structure as an equivalent class of the partial structure on the generation source and detecting a pattern by ignoring the difference between attribute values.
-
-
25. A program for enabling a computer forming a text mining apparatus to execute:
-
processing for analyzing a document in a storage unit that stores a set of documents as a text mining object and obtaining a sentence structure;
processing for generating, from a user input, a first determination item for determining whether or not structures are identical ones every type of differences between the sentence structure and a second determination item for determining whether or not structures are identical ones every type of differences between the attribute values; and
processing for performing predetermined conversion operation of a partial structure of the sentence structure in accordance with the first determination item for determining whether the structures are identical ones every type of differences between the sentence structures and generating the similar structure of the patterns having the similar meaning; and
processing for using the generated similar structure as an equivalent class of the partial structure on the generation source and detecting the frequent pattern in accordance with the second determination item for determining whether or not the structures are identical ones by ignoring the difference between the attribute values every type of differences between the attribute values.
-
Specification