Document classification apparatus
First Claim
1. A document classification apparatus comprising:
- a 5W1H keyword extraction means for extracting keywords, each of which has a 5W1H attribute, from a document entered through an input apparatus;
a classification key setting means for setting a 5W1H attribute entered through the input apparatus as a classification key;
a classification item selection means for selecting keywords as classification items, each of which has the 5W1H attribute set by the classification key setting means as a classification key, out of keywords extracted by the 5W1H keyword extraction means; and
a document distribution means for distributing a document including one of the keywords which are selected as classification items by the classification item selection means and have the 5W1H attribute set as classification keys by the classification key setting means, into a cell corresponding to one of the classification items.
1 Assignment
0 Petitions
Accused Products
Abstract
The objective of the present invention is to provide a document classification apparatus, which utilizes extracted keywords from the bodies of documents to classify the documents. The document classification apparatus is mainly made up of the following apparatus:
A 5W1H keyword extraction apparatus that extracts keywords with 5W1H attributes in an entered document. The classification key setting apparatus that sets the 5W1H attributes entered by a user as classification keys. The classification item selection apparatus that selects keywords as classification items from the extracted keywords with the 5W1H attributes set as the classification keys. The document distribution apparatus that distributes a document including the previously selected classification items with the 5W1H attributes set as classification keys, into a cell corresponding to the classification items. Enabling documents to be classified from meaningful 5W1H viewpoints.
85 Citations
22 Claims
-
1. A document classification apparatus comprising:
-
a 5W1H keyword extraction means for extracting keywords, each of which has a 5W1H attribute, from a document entered through an input apparatus;
a classification key setting means for setting a 5W1H attribute entered through the input apparatus as a classification key;
a classification item selection means for selecting keywords as classification items, each of which has the 5W1H attribute set by the classification key setting means as a classification key, out of keywords extracted by the 5W1H keyword extraction means; and
a document distribution means for distributing a document including one of the keywords which are selected as classification items by the classification item selection means and have the 5W1H attribute set as classification keys by the classification key setting means, into a cell corresponding to one of the classification items. - View Dependent Claims (2, 3)
-
-
4. A document classification apparatus comprising:
-
a 5W1H keyword extraction means for extracting keywords with 5W1H attributes from a document entered through an input apparatus;
a classification key setting means for setting two 5W1H attributes entered through the input apparatus as a vertical line classification key and a horizontally axial classification key, respectively;
a vertical line classification item selection means for selecting keywords with 5W1H attributes as vertical line classification items, extracted by the 5W1H keyword extraction means and then set by the classification key setting means as vertical line classification keys;
a horizontally axial classification item selection means for selecting keywords with 5W1H attributes as horizontally axial classification items, extracted by the 5W1H keyword extraction means and then set by the classification key setting means as horizontally axial keys; and
a document distribution means for distributing a document that includes the keyword selected as vertical line classification item by the vertical line classification item selection means with the 5W1H attribute set as the vertical line classification key by the classification key setting means and further includes the keyword selected as horizontally axial classification item by the horizontally axial classification item selection means with the 5W1H attribute set as the horizontally axial classification key by the classification key setting means, into a cell located on the intersection of both the row corresponding to the vertical line classification items and the column corresponding to the horizontally axial classification items in a 2-dimensional matrix. - View Dependent Claims (5, 6, 7, 8, 9)
-
-
10. A document classification apparatus comprising:
-
a 5W1H keyword extraction means for extracting keywords, each of which has a 5W1H attribute, from a document entered through an input apparatus;
a classification key setting means for setting a 5W1H attribute entered through the input apparatus as a classification key;
a classification level setting means for setting a level entered through the input apparatus as a classification level;
a concept data base means for classifying words which may appear in a document as keywords into hierarchic layers in accordance with an upper or lower notion level to each word, assigning a different level to each hierarchic layer, and storing them;
a classification item selection means for selecting broader concept words of the keywords extracted by the 5W1H keyword extraction means as classification items, each of which has the 5W1H attribute set by the classification key setting means as a classification key, from words in the classification level set by the classification level setting means in the concept data base means; and
a document distribution means for distributing a document including one of the keywords, whose broader concepts contain one of the words selected as classification items by the classification item selection means in the level set by the classification level setting means in the concept data base means and which has the 5W1H attribute set as the classification key by the classification key setting means, into a cell corresponding to one of the classification items. - View Dependent Claims (11, 12)
-
-
13. A document classification apparatus comprising:
-
a 5W1H keyword extraction means for extracting keywords with 5W1H attributes from a document entered through an input apparatus;
a classification key setting means for setting two 5W1H attributes entered through the input apparatus as a vertical line classification key and a horizontally axial classification key, respectively;
a classification level setting means for setting two levels entered through the input apparatus as a vertical line classification level and a horizontally axial classification level, respectively;
a concept data base means for classifying words which may appear as keywords in a document into hierarchic layers in accordance with the upper or lower notion level to each word, assigning a different level to each hierarchic layer, and storing them;
a vertical line classification item selection means for selecting as vertical line classification item a broader concept of the keyword extracted by the 5W1H keyword extraction means and being with the 5W1H attributes set as vertical line classification keys by the classification key setting means, from words in a vertical line classification level set by the classification level setting means in the concept data base means;
a horizontally axial classification item selection means for selecting as horizontally axial classification item a broader concept of the keyword extracted by the 5W1H keyword extraction means and being with the 5W1H attributes set as horizontally axial classification keys by the classification key setting means, from words in a horizontally axial classification level set by the classification level setting means in the concept data base means; and
a document distribution means for distributing a document including a first keyword word with the 5W1H attribute set as the vertical line classification key by the classification key setting means and a second keyword with the 5W1H attribute set as the horizontally axial classification key by the classification key setting means, into a cell located on the intersection of both the row corresponding to the vertical line classification items and the column corresponding to the horizontally axial classification items in a 2-dimensional matrix;
broader concepts of the first keyword containing the word selected as the vertical line classification item by the vertical line classification item selection means in the vertical line level set by the classification level setting means in the concept data base means, and broader concepts of the second keyword containing the word selected as the horizontally axial classification item by the horizontally axial classification item selection means in the horizontally axial level set by the classification level setting means in the concept data base means.- View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A document classification method comprising the steps of:
-
extracting keywords, each of which has a 5W1H attribute, from a document entered through an input apparatus;
setting a 5W1H attribute entered through the input apparatus as a classification key;
selecting keywords as classification items, each of which has the 5W1H attribute set in the setting step as a classification key, out of keywords extracted in the extracting keyword step; and
distributing a document including one of the keywords which are selected as the classification items and have the 5W1H attribute set as the classification key, into a cell corresponding to one of the classification items.
-
-
20. A document classification method comprising the steps of:
-
extracting keywords, each of which has a 5W1H attribute, from a document entered through an input apparatus;
setting a 5W1H attribute entered through the input apparatus as a classification key;
setting a level entered through the input apparatus as a classification level;
classifying words which may appear as keywords in a document, into the upper or lower notion hierarchic layer to the words, assigning a specific level to each hierarchic layer, and storing them as a concept data base;
selecting upper notion category words as classification items, each of which has the 5W1H attribute set in the setting a 5W1H attribute step as a classification key, out of the keywords extracted from words in the classification level; and
distributing a document including one of the keywords with the 5W1H attribute set as the classification key into a cell corresponding to one of the classification items;
the one of the keywords corresponding to a word selected as the classification item in the level under the upper notion category in the concept data base.
-
-
21. A computer-readable medium, comprising:
-
a computer-readable data storage device;
a program stored on said device, said program causing a computer;
to extract keywords, each of which has a 5W1H attribute, from a document entered through an input apparatus;
to set a 5W1H attribute entered through the input apparatus as a classification key;
to select keywords as classification items, each of which has the 5W1H attribute set in the to set step as a classification key, out of keywords extracted in the to extract keywords step; and
to distribute a document including one of the keywords which are selected as the classification items and have the 5W1H attribute set as the classification key into a cell corresponding to one of the classification items.
-
-
22. A computer-readable medium, comprising:
-
a computer-readable data storage device;
a program stored on said device, said program causing a computer;
to extract keywords, each of which has a 5W1H attribute, from a document entered through an input apparatus;
to set a 5W1H attribute entered through the input apparatus as a classification key;
to set a level entered through the input apparatus as a classification level;
to classify words, which may appear in a document as a keyword, into hierarchic notion layers in accordance with the upper or lower notion to each word, to assign a different level to each hierarchic notion layer, and to store them as a concept data base;
to select broader concept words as classification items to the extracted keywords, each classification item has the 5W1H attribute set in the to set a 5W1H attribute step as a classification key, from words in the classification level in terms of the concept data base; and
to distribute a document including one of the keywords with the 5W1H attribute set as the classification key, into a cell corresponding to one of the classification items;
the one of the keywords corresponding to a word selected as the classification item in the level under the broader concept in terms of the concept data base.
-
Specification