Apparatus, system, and method for improved portable document format (“PDF”) document archiving
First Claim
1. A method for improved Portable Document Format (“
- PDF”
) document archiving, the method comprising;
scanning by use of a processor, a source Portable Document Format (“
PDF”
) document for a shared resource, the source PDF document comprising a plurality of records, the shared resource comprising a common resource referenced by way of a resource pointer associated with a record of the source PDF document;
copying the shared resource to a resource group associated with the source PDF document;
short-circuiting a link between content for the shared resource and the resource pointer in each record that points to the shared resource, wherein short-circuiting a link further comprises modifying the resource pointer to point to the copied shared resource in the resource group and wherein short-circuiting a link between the shared resource and the resource pointer further comprises removing content for the shared resource from the source PDF document; and
extracting a record from the source PDF document, the extracted record void of the content for the shared resource in response to the short-circuited link.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus, system, and method are disclosed for improved Portable Document Format (“PDF”) document archiving. The method includes scanning a source PDF document for a shared resource. The source PDF document includes a plurality of records. The shared resource includes a common resource referenced by way of a resource pointer associated with a record of the source PDF document. The method includes copying the shared resource to a resource group associated with the source PDF document. The method also includes short-circuiting a link between content for the shared resource and the resource pointer in each record that points to the shared resource. The method includes extracting a record from the source PDF document. The extracted record is void of content for the shared resource in response to the short-circuited link. Thus, records may be stored in a standalone format without excessive storage space requirements.
-
Citations
16 Claims
-
1. A method for improved Portable Document Format (“
- PDF”
) document archiving, the method comprising;scanning by use of a processor, a source Portable Document Format (“
PDF”
) document for a shared resource, the source PDF document comprising a plurality of records, the shared resource comprising a common resource referenced by way of a resource pointer associated with a record of the source PDF document;copying the shared resource to a resource group associated with the source PDF document; short-circuiting a link between content for the shared resource and the resource pointer in each record that points to the shared resource, wherein short-circuiting a link further comprises modifying the resource pointer to point to the copied shared resource in the resource group and wherein short-circuiting a link between the shared resource and the resource pointer further comprises removing content for the shared resource from the source PDF document; and extracting a record from the source PDF document, the extracted record void of the content for the shared resource in response to the short-circuited link. - View Dependent Claims (2, 3, 4, 5, 6, 7)
- PDF”
-
8. An apparatus for improved Portable Document Format (“
- PDF”
) document archiving, the apparatus comprising;a non-transitory computer readable storage medium storing computer readable program code executable by a processor, the computer readable program code comprising; a scanning module configured to scan a source Portable Document Format (“
PDF”
) document for a shared resource, the source PDF document comprising a plurality of records, the shared resource comprising a common resource referenced by way of a resource pointer associated with a record of the source PDF document;a copying module configured to copy the shared resource to a resource group associated with the source PDF document; a short-circuiting module configured to short-circuit a link between the shared resource and the resource pointer in each record that points to the shared resource, wherein the short-circuiting module further comprises a modification module configured to modify the resource pointer to point to the copied shared resource in the resource group and further comprises a removal module configured to remove content for the shared resource from the source PDF document; and an extraction module configured to extract a record from the source PDF document, the extracted record void of content for the shared resource in response to the short-circuited link. - View Dependent Claims (9, 10, 11, 12)
- PDF”
-
13. A computer program product comprising a non-transitory computer readable storage medium having computer usable program code executable by a processor to perform operations for improved Portable Document Format (“
- PDF”
) document archiving, the operations of the computer program product comprising;scanning a source Portable Document Format (“
PDF”
) document for a shared resource, the source PDF document comprising a plurality of records, the shared resource comprising a common resource referenced by way of a resource pointer associated with a record of the source PDF document;copying the shared resource to a resource group associated with the source PDF document; modifying the resource pointer to point to the copied shared resource in the resource group such that a link between the shared resource and the resource pointer in each record that points to the shared resource is short-circuited; removing content for the shared resource from the source PDF document; and extracting a record from the source PDF document using a PDF Application Programming Interface (“
API”
), the extracted record void of content for the shared resource in response to the short-circuited link. - View Dependent Claims (14, 15, 16)
- PDF”
Specification