RETENTION OF CONTENT IN CONVERTED DOCUMENTS
First Claim
Patent Images
1. A method for lossless conversion of a PDF document to searchable PDF document using a processor device, comprising:
- receiving a PDF document having a potential first text layer;
performing an evaluation of quality of the potential first text layer, wherein if the potential first layer does not exist or is not acceptable, a second text layer is generated for searching or copying.
2 Assignments
0 Petitions
Accused Products
Abstract
For lossless conversion of a PDF document to searchable PDF document, the PDF document is received. The PDF document has a potential first text layer. An evaluation of quality of the first text layer is performed. The first text layer is determined to be nonexistent or unacceptable. A text recognition of the document is performed to generate a second text layer. The second text layer is made to be used for searching or copying.
15 Citations
27 Claims
-
1. A method for lossless conversion of a PDF document to searchable PDF document using a processor device, comprising:
-
receiving a PDF document having a potential first text layer; performing an evaluation of quality of the potential first text layer, wherein if the potential first layer does not exist or is not acceptable, a second text layer is generated for searching or copying. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for lossless conversion of a PDF document to searchable PDF document, the system comprising:
at least one processor device, wherein the at least one processor device; receives a PDF document having a potential first text layer; performs an evaluation of quality of the potential first text layer, wherein if the potential first text layer does not exist or is not acceptable, a second text layer is generated for searching or copying. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
19. A computer program product lossless conversion of a PDF document to searchable PDF document by a processor device, the computer program product comprising a non-transitory computer-readable storage medium having computer-readable program code portions stored therein, the computer-readable program code portions comprising:
-
a first executable portion that receives a PDF document having a potential first text layer; a second executable portion that performs an evaluation of quality of the potential first text layer, wherein if the potential first text layer does not exist or not acceptable, a second text layer is generated for searching or copying. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
Specification