Sentence classification device and method
First Claim
Patent Images
1. A sentence classification device characterized by comprising:
- a term list having a plurality of terms each comprising not less than one word;
DT matrix generation means for generating a DT matrix two-dimensionally expressing a relationship between each document contained in a document set and said each term;
DT matrix transformation means for generating a transformed DT matrix having clusters having blocks of associated documents by transforming the DT matrix obtained by said DT matrix generation means on the basis of a DM decomposition method used in a graph theory; and
classification generation means for generating classifications associated with the document set on the basis of a relationship between each cluster on the transformed DT matrix obtained by said DT matrix transformation means and said each document classified according to the clusters.
1 Assignment
0 Petitions
Accused Products
Abstract
A DT matrix generation means (11) generates a DT matrix (11A) from each document (D) in a document set (21) and each term (T) in a term list (22). A DT matrix transformation means (12) generates a transformed DT matrix (11B) by performing DM decomposition of the DT matrix (11A). A document classification means (13) extracts and outputs, for each cluster appearing on the transformed DT matrix (11B), each document (D) belonging to the cluster as one classification (subset).
53 Citations
18 Claims
-
1. A sentence classification device characterized by comprising:
-
a term list having a plurality of terms each comprising not less than one word;
DT matrix generation means for generating a DT matrix two-dimensionally expressing a relationship between each document contained in a document set and said each term;
DT matrix transformation means for generating a transformed DT matrix having clusters having blocks of associated documents by transforming the DT matrix obtained by said DT matrix generation means on the basis of a DM decomposition method used in a graph theory; and
classification generation means for generating classifications associated with the document set on the basis of a relationship between each cluster on the transformed DT matrix obtained by said DT matrix transformation means and said each document classified according to the clusters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A sentence classification method characterized by comprising:
-
the DT matrix generation step of generating a DT matrix two-dimensionally expressing a relationship between each document contained in a document set and each term of a term list having a plurality of terms each comprising not less than one word;
the DT matrix transformation step of generating a transformed DT matrix having clusters having blocks of associated documents by transforming the DT matrix on the basis of a DM decomposition method used in a graph theory; and
the classification generation step of generating classifications associated with the document set on the basis of a relationship between each cluster on the transformed DT matrix and said each document classified according to the clusters. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification