Document information processing apparatus and document information processing program
First Claim
1. A document information processing apparatus comprising:
- a document input and output section that is able to at least input or output a document as image data, said document input and output section including at least one of an image reader that scans a document and generates image data from the scanned document or an image communication section that sends and receives image data of a document via fax or e-mail;
an operation section that designates a file format of image data generated by said document input and output section;
a format conversion section that converts the image data into the file format designated by the operation section;
an operation timing detection section that detects predetermined operation timing for said document;
a metadata acquisition section that acquires metadata for searching or editing of said document based on said operation timing;
a metadata description section that describes said metadata in a predetermined format based on instance data of said document at predetermined timing with respect to the input or output of said document;
processing unit that controls the operation of said document input and output section, said operation timing detection section, said metadata acquisition section, and said metadata description section, wherein said processing unit controls said metadata description section such that the metadata acquired by said metadata acquisition section is described in a file converted by said format conversion section; and
an image analysis section that acciuires layout information by analyzing an image file, performs the reading of text information in areas that are recognized as text or character areas by using an optical character reader, and uses the data obtained as metadata.
1 Assignment
0 Petitions
Accused Products
Abstract
A document information processing apparatus is obtained in which there is no need to provide the consistency of management between the instances of documents and their metadata, that is, there is no fear that inconsistency in management might be caused, thereby eliminating the possibility of loading the system, which would otherwise result from the provision of managerial consistency, as well as making it possible to improve their versatility. The apparatus includes a document input and output section that is able to at least input or output a document as an image data, an operation timing detection section that detects predetermined operation timing for the document, a metadata acquisition section that acquires metadata of the document based on the operation timing, and a metadata description section that describes the metadata in a predetermined format based on instance data of the document at predetermined timing with respect to the input or output of the document.
26 Citations
20 Claims
-
1. A document information processing apparatus comprising:
-
a document input and output section that is able to at least input or output a document as image data, said document input and output section including at least one of an image reader that scans a document and generates image data from the scanned document or an image communication section that sends and receives image data of a document via fax or e-mail; an operation section that designates a file format of image data generated by said document input and output section; a format conversion section that converts the image data into the file format designated by the operation section; an operation timing detection section that detects predetermined operation timing for said document; a metadata acquisition section that acquires metadata for searching or editing of said document based on said operation timing; a metadata description section that describes said metadata in a predetermined format based on instance data of said document at predetermined timing with respect to the input or output of said document; processing unit that controls the operation of said document input and output section, said operation timing detection section, said metadata acquisition section, and said metadata description section, wherein said processing unit controls said metadata description section such that the metadata acquired by said metadata acquisition section is described in a file converted by said format conversion section; and an image analysis section that acciuires layout information by analyzing an image file, performs the reading of text information in areas that are recognized as text or character areas by using an optical character reader, and uses the data obtained as metadata. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer readable medium storing a document information processing program to make a computer execute:
-
designating a file format of image data; converting the image data into the file format designated by the designating step; detecting the timing of an operation performed on at least one of an input and an output of a document as image data; acquiring metadata for searching or editing of said document based on said operation timing; describing said metadata in a predetermined format based on instance data of said document at predetermined timing with respect to the input or output of said document; and controlling the operation of said designating step, said converting step, said detecting step, said acquiring step and said describing step, wherein the describing step is controlled such that the metadata acquired in said acquiring step is described in a file converted in said converting step; and acquiring layout information by analyzing an image file, performing reading of text information in areas that are recognized as text or character areas by using an optical character reader, and using the data obtained as metadata. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification