SYSTEM AND METHOD FOR RELATING UNSTRUCTURED DATA IN PORTABLE DOCUMENT FORMAT TO EXTERNAL STRUCTURED DATA
First Claim
1. A computer program product comprising computer readable instruction code executing in a tangible memory medium of a computer, said computer readable instruction code configured to:
- accept metadata input information that describes a pattern to match associated with a PDF file;
search said PDF file for said pattern;
generate a hotspot corresponding to said pattern in said PDF file; and
,store hotspot information comprising said hotspot wherein said hotspot is not stored as a hyperlink in said PDF file.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for relating unstructured data in portable document format to external structured data. A software component layered on top of an existing PDF document to bridge static information in the document to dynamic information in an external IT system. A PDF document may be parsed and “hotspotted” to provide clickable areas that allow for windows to show structured data without adding hyperlinks to the PDF document. Input information is used to provide descriptions of items of interest that are to be used as hotspots which are located in the document and optionally visually marked. The input information may be in the form of a general regular expression for example. Types of unstructured PDF files include manuals, brochures, etc. Types of structured data include material, business process, finance, or any other type of data including enterprise data. Dynamic data is thus obtained for a static PDF document. May also seamlessly mine PDF or other document files stored in a data repository without presentation to the user in the form of a view
104 Citations
21 Claims
-
1. A computer program product comprising computer readable instruction code executing in a tangible memory medium of a computer, said computer readable instruction code configured to:
-
accept metadata input information that describes a pattern to match associated with a PDF file; search said PDF file for said pattern; generate a hotspot corresponding to said pattern in said PDF file; and
,store hotspot information comprising said hotspot wherein said hotspot is not stored as a hyperlink in said PDF file. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product comprising computer readable instruction code executing in a tangible memory medium of a computer, said computer readable instruction code configured to:
-
obtain a PDF file to display; accept a metadata pattern; search at least one PDF file in a repository for said metadata pattern; generate at least one hotspot associated with said PDF file; and
,store hotspot information associated with said PDF file. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer program product comprising computer readable instruction code executing in a tangible memory medium of a computer, said computer readable instruction code configured to:
-
accept metadata input information that describes a pattern to match associated with a PDF file; search said PDF file for said pattern; generate a hotspot corresponding to said pattern in said PDF file; store hotspot information comprising said hotspot wherein said hotspot is not stored as a hyperlink in said PDF file; obtain said PDF file to display; display a PDF document as a visual instance of said PDF file; obtain said hotspot information; accept a user gesture; access external information associated with said hotspot information; and
,present external structured data in a user interface component wherein said external structured data is associated with said hotspot information and said metadata input information. - View Dependent Claims (17, 18, 19, 20, 21)
-
Specification