Document information processing method, document information processing apparatus, communication system and memory product
First Claim
1. A document information processing method for processing document information containing character information, comprising the steps of generating intermediate information containing same character information as in the document information, based on the document information;
- extracting word information representing words from the document information or the intermediate information; and
generating summary information by adding the extracted word information to the intermediate information.
1 Assignment
0 Petitions
Accused Products
Abstract
In a document information processing apparatus, intermediate information, which contains the same character information as in document information created by a document creation application and is used for reduction of the amount of the document information, is generated based on the document information, word information contained in the document information or in the intermediate information is extracted, and summary information is generated by adding the extracted word information to the intermediate information which was subjected to a reduction of amount of information according to the need. The generated summary information not only has a small data volume but also contains all the word information, and is therefore usable for a searching process using character information, such as full-text searching.
26 Citations
20 Claims
-
1. A document information processing method for processing document information containing character information, comprising the steps of
generating intermediate information containing same character information as in the document information, based on the document information; -
extracting word information representing words from the document information or the intermediate information; and
generating summary information by adding the extracted word information to the intermediate information.
-
-
2. A document information processing apparatus for processing document information containing character information, comprising:
-
a first generating unit for generating intermediate information containing same character information as in the document information, based on the document information;
an extracting unit for extracting word information representing words from the character information contained in the document information or in the generated intermediate information; and
a second generating unit for generating summary information by adding the extracted word information to the intermediate information. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 18)
-
-
13. A document information processing apparatus for processing document information containing character information, comprising:
-
a fourth generating unit for generating image information by irreversibly compressing the document information;
an extracting unit for extracting word information representing words from the character information contained in the document information; and
a fifth generating unit for generating summary information by adding the extracted word information to the generated image information. - View Dependent Claims (15, 17, 19)
-
-
20. A computer readable memory product storing a computer program for causing a computer to process document information containing character information,
wherein said memory product stores a computer program comprising the steps of: -
causing the computer to generate intermediate information containing same character information as in the document information, based on the document information;
causing the computer to extract word information representing words from the document information or the intermediate information; and
causing the computer to generate summary information by adding the extracted word information to the intermediate information.
-
Specification