DOCUMENT ANALYSIS APPARATUS AND PROGRAM
First Claim
1. A document analysis apparatus comprising:
- a document storage unit which stores a plurality of documents each of which includes a text formed from a plurality of words, has a plurality of attributes, and includes attribute values of the attributes;
a pattern storage unit which stores a plurality of patterns each representing presence/absence of a correlation between a word and each of at least two attributes out of the plurality of attributes;
an acquisition unit which acquires a plurality of words by analyzing the text included in each of the plurality of documents stored in the document storage unit;
a first determination unit which determines, for each of the acquired words, the presence/absence of the correlation between the word and at least two attributes designated by a user out of the plurality of attributes of the plurality of documents stored in the document storage unit;
a second determination unit which determines whether a determination result by the first determination unit matches a pattern designated by the user out of the plurality of patterns stored in the pattern storage unit; and
a presentation unit which presents a word whose determination result by the first determination unit is determined to match the pattern designated by the user.
1 Assignment
0 Petitions
Accused Products
Abstract
A document analysis apparatus according to an embodiment an acquisition unit acquires a plurality of words by analyzing a text included in each of a plurality of documents stored in a document storage unit. A first determination unit determines, for each of the acquired words, the presence/absence of a correlation between the word and at least two attributes designated by a user out of a plurality of attributes of the plurality of documents stored in the document storage unit. A second determination unit determines whether a determination result by the first determination unit matches a pattern designated by the user out of a plurality of patterns stored in a pattern storage unit. A presentation unit presents a word whose determination result by the first determination unit is determined to match the pattern designated by the user.
-
Citations
6 Claims
-
1. A document analysis apparatus comprising:
-
a document storage unit which stores a plurality of documents each of which includes a text formed from a plurality of words, has a plurality of attributes, and includes attribute values of the attributes; a pattern storage unit which stores a plurality of patterns each representing presence/absence of a correlation between a word and each of at least two attributes out of the plurality of attributes; an acquisition unit which acquires a plurality of words by analyzing the text included in each of the plurality of documents stored in the document storage unit; a first determination unit which determines, for each of the acquired words, the presence/absence of the correlation between the word and at least two attributes designated by a user out of the plurality of attributes of the plurality of documents stored in the document storage unit; a second determination unit which determines whether a determination result by the first determination unit matches a pattern designated by the user out of the plurality of patterns stored in the pattern storage unit; and a presentation unit which presents a word whose determination result by the first determination unit is determined to match the pattern designated by the user. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A program stored in a non-transitory computer-readable storage medium, the program being executed by a computer of a document analysis apparatus including a document storage unit which stores a plurality of documents each of which includes a text formed from a plurality of words, has a plurality of attributes, and includes attribute values of the attributes, and a pattern storage unit which stores a plurality of patterns each representing presence/absence of a correlation between a word and each of at least two attributes out of the plurality of attributes, the program causing the computer to execute an analysis method, the analysis method comprising:
-
acquiring a plurality of words by analyzing the text included in each of the plurality of documents stored in the document storage unit; determining, for each of the acquired words, the presence/absence of the correlation between the word and at least two attributes designated by a user out of the plurality of attributes of the plurality of documents stored in the document storage unit; determining whether a determination result matches a pattern designated by the user out of the plurality of patterns stored in the pattern storage unit; and presenting a word whose determination result is determined to match the pattern designated by the user.
-
Specification