System and method for data publication through web pages
First Claim
1. A method for automatically publishing data in a final publication format, wherein the data is in the form of a newspaper having an original, existing format including a plurality of pages, each page having a predetermined layout comprising a plurality of independently standing data blocks, each block having an internal structure, the method comprising:
- obtaining a scanned representation of said newspaper, said representation preserving said layout,from said representation automatically analyzing the newspaper to decompose the predetermined layout of each page of the newspaper in the original, existing format into said plurality of blocks, each block representing an object;
said analyzing furthermore applying knowledge of newspaper structure and identifying at least one logical relationship between said blocks;
converting each object to an internal publication format, said internal publication format identifying and preserving said internal structure of said blocks within said objects, said internal publication format furthermore preserving said layout as a relationship between said objects;
said internal publication format furthermore preserving said logical relationship; and
said internal publication format comprising a mark-up language to indicate said objects and features of said internal structure wherein said mark-up language is XML; and
rendering said internal publication format to incorporate said objects, said layout and respective internal structures and said logical relationship in the final publication format;
thereby forming said newspaper into said final publication format from objects in said internal publication format by rendering.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and a method for publishing a newspaper page or other data through a Web page, such that the information can be made available more easily through a network such as the Internet. The data is automatically converted to the Web page format by first rendering the newspaper page into a digital format; converting the digital format to a basic internal publishing format; and then publishing the data in any one of a number of different possible publishing formats, including but not limited, a mark-up language document such as a Web page for example. The present invention supports such advanced features as arrangement of the content of the newspaper according to relationships within the information of the content and/or according to the preference(s) of the user by analyzing the newspaper page as a plurality of objects. Each newspaper object may optionally be a title, an article, a picture and/or other graphic advertisement, and so forth.
-
Citations
31 Claims
-
1. A method for automatically publishing data in a final publication format, wherein the data is in the form of a newspaper having an original, existing format including a plurality of pages, each page having a predetermined layout comprising a plurality of independently standing data blocks, each block having an internal structure, the method comprising:
-
obtaining a scanned representation of said newspaper, said representation preserving said layout, from said representation automatically analyzing the newspaper to decompose the predetermined layout of each page of the newspaper in the original, existing format into said plurality of blocks, each block representing an object;
said analyzing furthermore applying knowledge of newspaper structure and identifying at least one logical relationship between said blocks;converting each object to an internal publication format, said internal publication format identifying and preserving said internal structure of said blocks within said objects, said internal publication format furthermore preserving said layout as a relationship between said objects;
said internal publication format furthermore preserving said logical relationship; and
said internal publication format comprising a mark-up language to indicate said objects and features of said internal structure wherein said mark-up language is XML; andrendering said internal publication format to incorporate said objects, said layout and respective internal structures and said logical relationship in the final publication format;
thereby forming said newspaper into said final publication format from objects in said internal publication format by rendering. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. An electronic system tangibly embodied in a computer for automatically publishing received data of a pre-existing newspaper, the newspaper in a computerized format, the system comprising:
-
(a) at least one source of newspaper data, said source preserving an original, structure of the newspaper, the computerized format comprising a plurality of pages, each page having a predetermined layout comprising a plurality of independently standing data blocks, each block having an internal structure; (b) a mark-up language distiller module implemented on a processor, for converting the newspaper from said original format to a mark-up language format, wherein said mark-up language distiller module is configured to apply knowledge of newspaper structure and automatically analyzes the newspaper data in said original, existing digital format to (1) decompose the newspaper data into said plurality of blocks, each block with said internal structure representing an independent data object, each object having content and at least one attribute of the data, such that each object is converted to said mark-up language format, said markup language distiller module further analyzing and preserving said structure; and
(2) identify at least one logical relationship between blocks, said markup language distiller module further analyzing and preserving said logical relationship, wherein said mark-up language format is XML; and(c) a publisher server for rendering the data from said mark-up language format to a final publication format, said final publication format being a rendered electronic format for display incorporating, as objects, said blocks with said internal structure, and further defining interrelationships between said objects as part of said electronic format;
said publication server thereby forming said newspaper into said final publication format from objects in said internal publication format by rendering. - View Dependent Claims (24, 25, 26, 27, 28)
-
-
29. A method for automatically publishing data of a preexisting document in a final publication format, wherein the data is received in an original, existing format comprising a plurality of pages, each page having a predetermined layout comprising a plurality of independently standing data blocks, each block having an internal structure the method comprising:
-
obtaining a representation of said preexisting document, said representation preserving said layout; from said representation, automatically analyzing the preexisting document to decompose the data received in the original, existing format into a plurality of objects, each object corresponding to one of said blocks;
said analyzing furthermore applying knowledge of document structure and identifying at least one logical relationship between said blocks;preparing a list of text and/or graphic elements for each object; determining properties of each element, including determining visibility and overlap characteristics for each graphic element within said object; recognizing structural layout properties of the data in an original format; converting each object to an internal publication format;
said internal publication format furthermore preserving said logical relationship; and
said internal publication format comprising a mark-up language to indicate said objects and features of said internal structure, wherein said mark-up language format is XML andrendering said internal publication format in the final publication format, said final publication format presenting said blocks as said independently standing objects incorporating said internal structure and said layout and said logical relationship;
thereby forming said newspaper into said final publication format from objects in said internal publication format by rendering.
-
-
30. A method for automatically publishing received data in a final publication format, the data having an original, existing format comprising a plurality of pages, each page having a predetermined layout comprising a plurality of independently standing data blocks, the method comprising:
-
obtaining a representation of said received data having an original existing format, said representation preserving said layout; from said representation, automatically analyzing the pages to decompose the pages of the data received in the original, existing format into a plurality of objects, said objects corresponding to said blocks;
said analyzing furthermore applying knowledge of page structure and identifying at least one logical relationship between said blocks;preparing a list of text and/or graphic elements for each object; determining properties of each element, including determining a special characteristic for each text element; recognizing structural layout properties of the data in an original format; converting each object to an internal publication format;
said internal publication format identifying and preserving said internal structure of said blocks within said objects, said internal publication format furthermore preserving said layout as a relationship between said objects;
said internal publication format furthermore preserving said logical relationship; and
said internal publication format comprising a mark-up language to indicate said objects and features of said internal structure, wherein said mark-up language format is XML, andrendering said internal publication format in the final publication format such as to include said recognized structure in said objects and said logical relationship and to include said layout;
thereby forming said received data into said final publication format from objects in said internal publication format by rendering.
-
-
31. A method for automatically publishing received data in a final publication format, wherein the data is in the form of a newspaper, the newspaper having an original, existing format comprising a plurality of pages, each page having a predetermined layout comprising a plurality of independently standing data blocks, each block having structural layout properties, the method comprising:
-
obtaining a representation of said newspaper, said representation preserving said layout; from said representation, automatically analyzing the newspaper to decompose the data received in the original, existing format into a plurality of objects, said objects corresponding to said independently standing blocks; and said analyzing furthermore applying knowledge of newspaper structure and identifying at least one logical relationship between said blocks; preparing a list of text and/or graphic elements for each object; determining properties of each element; recognizing said structural layout properties of the data in an original format; determining each text segment for each object; building a text block from a plurality of aligned text segments; converting each object to an internal publication format;
said internal publication format furthermore preserving said logical relationship; and
said internal publication format comprising a mark-up language to indicate said objects and features of said internal structure, wherein said mark-up language format is XML andrendering said internal publication format in the final publication format to comprise said blocks as objects incorporating said structural layout properties, and to include said layout and said logical relationship;
thereby forming said newspaper into said final publication format from objects in said internal publication format by rendering.
-
Specification