DOCUMENT CLASSIFICATION SYSTEM, DOCUMENT CLASSIFICATION METHOD, AND DOCUMENT CLASSIFICATION PROGRAM
First Claim
1. A document classification system that acquires digital information recorded in a plurality of computers or servers, analyzes document information included in the acquired digital information, and classifies the document information so as to be readily used in a lawsuit, comprising:
- an extraction unit that extracts a document group, which is a data set including a predetermined number of documents, from the document information;
a classification code receiving unit that receives a classification code;
a selection unit that classifies the extracted document group for each classification code on the basis of the classification code, analyzes a keyword which commonly appears in the classified document group, and selects the keyword;
a search unit that searches for the keyword;
a score calculation unit that calculates a score indicating connection between the classification code and the document, using the search result of the search unit and the analysis result of the selection unit; and
an automatic classification unit that automatically assigns the classification code on the basis of the result of the score.
2 Assignments
0 Petitions
Accused Products
Abstract
A document classification system is provided. The document classification system analyzes digital document information which is collected to be submitted as evidence in a lawsuit and classifies the digital document information. The document classification system includes an extraction unit that extracts documents from the collected document information, a document display unit that displays an extracted document group, a classification code receiving unit that receives a classification code assigned to the displayed document group, a selection unit that classifies the extracted document group for each classification code, analyzes a keyword commonly appearing in the classified document group, and selects the keyword, a database that records the selected keyword, a search unit that searches for the keyword from the document information, a score calculation unit that calculates a score indicating connection between the classification code and the document, and an automatic classification unit that automatically assigns the classification code.
7 Citations
13 Claims
-
1. A document classification system that acquires digital information recorded in a plurality of computers or servers, analyzes document information included in the acquired digital information, and classifies the document information so as to be readily used in a lawsuit, comprising:
-
an extraction unit that extracts a document group, which is a data set including a predetermined number of documents, from the document information; a classification code receiving unit that receives a classification code; a selection unit that classifies the extracted document group for each classification code on the basis of the classification code, analyzes a keyword which commonly appears in the classified document group, and selects the keyword; a search unit that searches for the keyword; a score calculation unit that calculates a score indicating connection between the classification code and the document, using the search result of the search unit and the analysis result of the selection unit; and an automatic classification unit that automatically assigns the classification code on the basis of the result of the score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 12, 13)
-
-
10. A document classification method that is performed in a document classification system which acquires digital information recorded in a plurality of computers or servers, analyzes document information included in the acquired digital information, and classifies the document information so as to be readily used in a lawsuit, the document classification method comprising:
-
extracting a document group, which is a data set including a predetermined number of documents, from the document information; displaying the extracted document group on a screen; receiving a classification code which is assigned to the displayed document group by a user on the basis of connection to the lawsuit; classifying the extracted document group for each classification code on the basis of the classification code, analyzing a keyword which commonly appears in the classified document group, and selecting the keyword; recording the selected keyword; searching for the recorded keyword from the document information; calculating a score indicating connection between the classification code and the document, using the search result and the analysis result; and automatically assigning the classification code on the basis of the result of the score.
-
-
11. A non-transitory recording medium recording therein a document classification program, which when executed by at least one processor, causes the processor to perform a plurality functions to acquire digital information recorded in a plurality of computers or servers, analyze document information included in the acquired digital information, and classify the document information so as to be readily used in a lawsuit, the plurality of functions comprising:
-
a function of extracting a document group, which is a data set including a predetermined number of documents, from the document information; a function of displaying the extracted document group on a screen; a function of receiving a classification code which is assigned to the displayed document group by a user on the basis of connection to the lawsuit; a function of classifying the extracted document group for each classification code on the basis of the classification code, analyzing a keyword which commonly appears in the classified document group, and selecting the keyword; a function of recording the selected keyword; a function of searching for the recorded keyword from the document information; a function of calculating a score indicating connection between the classification code and the document, using the search result and the analysis result; and a function of automatically assigning the classification code on the basis of the result of the score.
-
Specification