Generator for document with HTML tagged table having data elements which preserve layout relationships of information in bitmap image of original document
First Claim
1. A method for generating a hypertext markup language (HTML) file based on a bitmap image, comprising:
- obtaining at least two horizontally adjacent image blocks in separate vertical columns of the bitmap image, including an image block for plural character images and an image block for a bitmap image;
generating an image file which stores the bitmap image; and
automatically generating, from the image blocks, an HTML file in which blocks corresponding to the two horizontally adjacent image blocks obtained from the bitmap image are tagged as data elements in a row of an HTML tagged table, wherein the block in the HTML file corresponding to the image block for the bitmap image includes an HTML link to the generated image file.
2 Assignments
0 Petitions
Accused Products
Abstract
Automatic generation of hypertext markup language (HTML) files based on bitmap image data, which faithfully preserves layout information of an original document from which the bitmap data was obtained. Generally, multi-column document layouts result in automatic generation of HTML files that use HTML "table tags" to display each of the different columns. More particularly, a bitmap image is obtained such as by scanning or retrieval of a pre-existing image, and the bitmap image is segmented into blocks. The location of each block is determined, each block is analyzed in preparation for insertion of appropriate data into an HTML file, and layout analysis is performed to identify layout relationships between the blocks based on the relative locations of the blocks in the bitmap image. Based on the layout relationships, a block type is determined for each block, column span and row span data for each block is determined, blocks are re-ordered if needed, and an HTML file is generated in which blocks are tagged as data elements in a row of an HTML "table tag" based on block type and based on column and row span information for the block.
169 Citations
66 Claims
-
1. A method for generating a hypertext markup language (HTML) file based on a bitmap image, comprising:
-
obtaining at least two horizontally adjacent image blocks in separate vertical columns of the bitmap image, including an image block for plural character images and an image block for a bitmap image; generating an image file which stores the bitmap image; and automatically generating, from the image blocks, an HTML file in which blocks corresponding to the two horizontally adjacent image blocks obtained from the bitmap image are tagged as data elements in a row of an HTML tagged table, wherein the block in the HTML file corresponding to the image block for the bitmap image includes an HTML link to the generated image file. - View Dependent Claims (2, 3, 4)
-
-
5. A method for generating a hypertext markup language (HTML) file based on a bitmap image, comprising:
-
a segmenting step of segmenting the bitmap image to obtain image blocks, including an image block for plural character images and an image block for a bitmap image; a determining step of determining where in the bitmap image each of the image blocks is located; an identifying step of identifying positional relationships between the image blocks based on thier relative locations; a first generating step of generating an image file which stores the bitmap image; and a second step of automatically generating an HTML file in which blocks corresponding to the image blocks obtained from the bitmap image are tagged as being data elements in an HTML tagged table, the tags being determined in accordance with the identified positional relationships, wherein the block in the HTML file corresponding to the image block for the bitmap image includes an HTML link to the generated image file. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method for generating a hypertext markup language (HTML) file based on bitmap image data, comprising the steps of:
-
segmenting the bitmap image into image blocks, including an image block for plural character images and an image block for a bitmap image; analyzing layout relationships between the image blocks based on the relative locations of the image blocks in the bitmap image; a block-type determination step of determining block type of each image block based on the layout relationships obtained in said analyzing step; a span determination step of determining column span and row span information for each image block that spans more than one column and/or row; a first generating step of generating an image file which stores the bitmap image; and a second generating step of automatically generating an HTML file in which blocks corresponding to the image blocks obtained from the bitmap image are tagged as being data elements in an HTML tagged table, the tags being determined based on the block type obtained in said block-type determination step and based on the column span and row span information obtained in said span determination step, wherein the block in the RTML file corresponding to the image block for the bitmap image includes an HTML link to the generated image tile. - View Dependent Claims (18, 19, 20, 21, 22)
-
-
23. An apparatus for generating a hypertext markup language (HTML) file based on a bitmap image, comprising:
-
a memory which stores process steps that are executable by a processor; and a processor which executes the process steps stored in the memory to (1) obtain at lest two horizontally adjacent image blocks in separate vertical columns of the bitmap image, including an image block for plural character images and an image block for a bitmap image, (2) generate an image file which stores the bitmap image, and (3) automatically generate an HTML file in which blocks corresponding to the two horizontally adjacent image blocks obtained by the obtaining means are tagged as data elements in a row of an HTML tagged table, wherein the block in the HTML file corresponding to the image block for the bitmap image includes an HTML link to the generated image file. - View Dependent Claims (24, 25, 26)
-
-
27. An apparatus for generating a hypertext markup language (HTML) file based on a bitmap image, comprising:
-
a memory which stores process steps that are executable by a processor; and a processor which executes the process steps stored in the memory (1) to segment the bitmap image to obtain image blocks, including an image block for plural character images and an image block for a bitmap image, (2) to determine where in the bitmap image each of the image blocks is located, (3) to identify positional relationships between the image blocks based on their relative locations, (4) to generate an image file which stores the bitmap image, and (5) to automatically generate an HTML file in which blocks corresponding to the image blocks obtained from the bitmap image are tagged as being data elements in an RTML tagged table, the tags being determined in accordance with the identified positional relationships, wherein the block in the HTML file corresponding to the image block for the bitmap image includes an HTML link to the generated image file. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. An apparatus for generating a hypertext markup language (HTML) file based on bitmap image data, comprising:
-
a memory for storing the bitmap image data and process steps executable by a processor; and a processor which executes the process steps stored in the memory (1) to segment the bitmap image into image blocks, including an image block for plural character images and an image block for a bitmap image, (2) to analyze layout relationships between the image blocks based on the relative locations of the image blocks in the bitmap image, (3) to determine a block type of each image block based on the layout relationships obtained by the processor, (4) to determine column span and row span information for each image block that spans more than one column and/or row, (5) to generate an image file which stores the bitmap image; and
(6) to automatically generate an HTML file in which blocks corresponding to the image blocks obtained from the bitmap image are tagged as being data elements in an HTML tagged table, the tags being determined based on the block type determined by the processor and the column span and row span information determined by the processor, wherein the block in the HTML file corresponding to the image block for the bitmap image includes an HTML link to the generated image file. - View Dependent Claims (40, 41, 42, 43, 44)
-
-
45. Computer-executable process steps stored on a computer-readable medium, the computer executable process steps to generate a hypertext markup language (HTML) file based on a bitmap image, the process steps comprising:
-
an obtaining step to obtain at least two horizontally adjacent image blocks in separate vertical columns of the bitmap image, including an image block for plural characters and an image block for a bitmap image; a generating step to generate an image file which stores the bitmap image; and an automatic generating step to automatically generate an HTML file in which blocks corresponding to the two horizontally adjacent image blocks obtained from the bitmap image are tagged as data elements in a row of an HTML tagged table, wherein the block in the HTML file corresponding to the image block for the bitmap image includes an HTML link to the generated image file. - View Dependent Claims (46, 47, 48)
-
-
49. Computer-executable process steps stored on a computer-readable medium, the computer executable process steps to generate a hypertext markup language (HTML) file based on a bitmap image, the computer-executable process steps comprising:
-
a segmenting step to segment the bitmap image to obtain image blocks, including an image block for plural character images and an image block for a bitmap image; a determining step to determining where in the bitmap image each of the image blocks is located; an identifying step to identify positional relationships between the image blocks based on their relative locations; a first generating step to generate an image file which stores the bitmap image; and a second generating step to automatically generate an HTML file in which blocks corresponding to the image blocks obtained from the bitmap image are tagged as being data elements in an HTML tagged table, the tags being determined in accordance with the identified positional relationships, wherein the block in the HTML file corresponding to the image block for the bitmap image includes an HTML link to the generated image file. - View Dependent Claims (50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60)
-
-
61. Computer-executable process steps stored on a computer-readable medium, the computer executable process steps to generate a hypertext markup language (HTML) file based on bitmap image data, the computer-executable process steps comprising:
-
a segmenting step to segment the bitmap image into image blocks, including an image block for plural character images and an image block for a bitmap image; an analyzing step to analyze layout relationships between the image blocks based on the relative locations of the image-blocks in the bitmap image; a block-type determination step to determine a block type of each image block based on the layout relationships obtained in said analyzing step; a span determination step to determine column span and row span information for each image block that spans more than one column and/or row; a first generating step to generate an image file which stores the bitmap image; and a second generating step to automatically generate an HTML file in which blocks corresponding to the image blocks obtained from the bitmap image are tagged as being data elements in an HTML tagged table, the tags being determined based on the block type obtained in said block-type determination step and based on the column span and row span information obtained in said span determination step, wherein the block in the HTML file corresponding to the image block for the bitmap image includes an HTML link to the generated image file. - View Dependent Claims (62, 63, 64, 65, 66)
-
Specification