IMAGE RETRIEVAL APPARATUS, METHOD FOR RETRIEVING IMAGE, AND CONTROL PROGRAM FOR IMAGE RETRIEVAL APPARATUS
First Claim
1. An apparatus comprising:
- an inputting unit configured to input document image data;
a layout analysis unit configured to divide the document image data into a plurality of regions according to attribute thereof to generate layout information for the plurality of regions in a unit of a page;
a processing unit configured to classify each page of document image data into one of a plurality of groups, based on corresponding layout information generated by the layout analysis unit;
a specification unit configured to specify one of the plurality of groups according to which a user requests to retrieve one or more pages of document image data; and
a retrieval unit configured to retrieve one or more pages of document image data belonging to the group, which is specified by the specification unit, from among a plurality of pages of document image data.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus divides a document image of each page which is to be input and stored into a plurality of regions according to image attribute contained in the document image to generate layout analysis data of each region. Further, the image data of each page is classified so that the image data belongs to one of a plurality of clusters based on the analysis data. When the document image of the page is retrieved, representative layout images in each cluster are displayed. A user selects and specifies the layout representative image which is the closest to the layout of the document image of the page which the user memorizes and desires to retrieve. Thus, the cluster is specified, and the image data of a page belonging to the cluster is retrieved and output.
24 Citations
11 Claims
-
1. An apparatus comprising:
-
an inputting unit configured to input document image data; a layout analysis unit configured to divide the document image data into a plurality of regions according to attribute thereof to generate layout information for the plurality of regions in a unit of a page; a processing unit configured to classify each page of document image data into one of a plurality of groups, based on corresponding layout information generated by the layout analysis unit; a specification unit configured to specify one of the plurality of groups according to which a user requests to retrieve one or more pages of document image data; and a retrieval unit configured to retrieve one or more pages of document image data belonging to the group, which is specified by the specification unit, from among a plurality of pages of document image data. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method comprising:
-
inputting document image data; dividing each page of the document image data into a plurality of regions according to image attribute thereof to generate layout information for the plurality of regions; classifying each page of the document image data into one of a plurality of groups, based on corresponding layout information; specifying one of the plurality of groups according to which a user requests to retrieve one or more pages of document image data; and retrieving one or more pages of document image data belonging to the specified group, from among a plurality of pages of the document image data. - View Dependent Claims (8, 9, 10, 11)
-
Specification