INTEGRATED DOCUMENT VIEWER
First Claim
1. A method for converting and integrating non-HTML documents into HTML web pages on a host server while preserving the original appearance and text searchability of the documents, the method including the following steps:
- (a) parsing a document to extract text characters and associated fonts, as well as page layout attributes of the document, each glyph in a font representing the appearance of its associated text character with respect to that font;
(b) integrating the text characters into an HTML web page and generating HTML tags to preserve the document'"'"'s page layout attributes;
(c) generating one or more font files, accessible from the HTML web page, that map the text characters to their associated glyphs; and
(d) storing the HTML web page and font files on the host server for delivery to and rendering within the window of a client web browser, whereby the original appearance and text searchability of the document is preserved.
2 Assignments
0 Petitions
Accused Products
Abstract
In various embodiments of the present invention, documents (eg, PDFs) are converted into HTML 5 (and CSS 3) formats and integrated into existing HTML 5 web pages to preserve the original embedded fonts. The fonts can also be integrated or embedded (e.g., via the standard HTML “iframe” tag) into other web pages. The original appearance of the source document is maintained, the text is preserved as searchable text, and the document is integrated into a web page that can be searched, zoomed, scrolled, and printed utilizing standard web browser controls. A significantly increased “ad inventory” is thereby enabled, wherein advertisements can be integrated between pages, or even within a page. Moreover, the resulting document can be passively shared with members of a user'"'"'s external social networks (including those within the host website), along with other activities and behaviors performed by the user on the hosting website.
-
Citations
19 Claims
-
1. A method for converting and integrating non-HTML documents into HTML web pages on a host server while preserving the original appearance and text searchability of the documents, the method including the following steps:
-
(a) parsing a document to extract text characters and associated fonts, as well as page layout attributes of the document, each glyph in a font representing the appearance of its associated text character with respect to that font; (b) integrating the text characters into an HTML web page and generating HTML tags to preserve the document'"'"'s page layout attributes; (c) generating one or more font files, accessible from the HTML web page, that map the text characters to their associated glyphs; and (d) storing the HTML web page and font files on the host server for delivery to and rendering within the window of a client web browser, whereby the original appearance and text searchability of the document is preserved. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for displaying text in a web page using the built-in functionality of a web browser, while inhibiting the use of that functionality to search for and copy the text, the method including the following steps:
-
(a) generating a font file containing, for each text character, a corresponding glyph representing the appearance of that character, and a mismatched character code that does not correspond to the glyph; and (b) generating an HTML document that contains the mismatched character codes and instructions directing the web browser to use the font file for displaying the glyphs, (c) whereby the web browser utilizes the font file to display the text correctly, but cannot search for or copy the text due to the mismatched character codes in the HTML document.
-
-
13. A system that converts and integrates non-HTML documents into HTML web pages on a host server while preserving the original appearance and text searchability of the documents, the system comprising:
-
(a) a document parser that extracts text characters and associated fonts, as well as page layout attributes of the document, each glyph in a font representing the appearance of its associated text character with respect to that font; (b) an HTML converter that integrates the text characters into an HTML web page and generates HTML tags to preserve the document'"'"'s page layout attributes; (c) a font file generator that generates one or more font files, accessible from the HTML web page, that map the text characters to their associated glyphs; and (d) a website host on the host server that stores the HTML web page and font files for delivery to and rendering within the window of a client web browser, whereby the original appearance and text searchability of the document is preserved. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification