Document management system
First Claim
1. A document management system, comprising:
- an image input unit to input a document as an electronic image;
a character extraction unit to extract character information from the input electronic image;
a word extraction unit to extract words from the character information;
a document search unit to normalize the extracted words, to register the normalized words in an index, and to search electronic images using the index;
an attribute information generation unit to generate attribute information of the input electronic image, where the attribute information includes the extracted words, positions and sizes of the extracted words in the input electronic image, and the normalized words referring to the positions and sizes of corresponding extracted words;
a search condition input unit to input a search keyword for use by the document search unit when searching for a target electronic image; and
a word highlighting unit to highlight the search keyword in the target electronic image found by the document search unit based on the attribute information.
1 Assignment
0 Petitions
Accused Products
Abstract
A document management system includes an image input unit that inputs a document as an electronic image; a character extraction unit that extracts character information from the input electronic image; a word extraction unit that extracts words from the character information; a document search unit that normalizes the extracted words, registers the normalized words in an index, and searches electronic images using the index; an attribute information generation unit that generates attribute information including the extracted words, positions and sizes of the extracted words, and the normalized words referring to the positions and sizes of corresponding extracted words; a search condition input unit that inputs a search keyword that is used by the document search unit when searching for a target electronic image; and a word highlighting unit that highlights the search keyword in the target electronic image found by the document search unit based on the attribute information.
26 Citations
5 Claims
-
1. A document management system, comprising:
-
an image input unit to input a document as an electronic image; a character extraction unit to extract character information from the input electronic image; a word extraction unit to extract words from the character information; a document search unit to normalize the extracted words, to register the normalized words in an index, and to search electronic images using the index; an attribute information generation unit to generate attribute information of the input electronic image, where the attribute information includes the extracted words, positions and sizes of the extracted words in the input electronic image, and the normalized words referring to the positions and sizes of corresponding extracted words; a search condition input unit to input a search keyword for use by the document search unit when searching for a target electronic image; and a word highlighting unit to highlight the search keyword in the target electronic image found by the document search unit based on the attribute information. - View Dependent Claims (2, 3, 4, 5)
-
Specification