System, method, and computer program product for generating documents using pagination information
First Claim
Patent Images
1. A method of generating a new document from a source text document and a source image document, comprising the steps of:
- (1) accessing said source text document and said source image document;
(2) at least partially paginating said source text document with said source image document to produce at least partial pagination information; and
(3) generating said new document using said at least partial pagination information, wherein said new document is an equivalent text file comprising one or more of (A)-(M);
(A) information representing an approximate arrangement of at least some bibliographic data as represented in said source image document,(B) information that effectively provides an association between at least a portion of said source text document and at least a portion of said source image document,(C) special character information specifying at least one mapping of a group of characters in said source text document to at least one special character in said source image document,(D) column information representing at least an approximate arrangement of text in columns,(E) line information representing at least an approximate arrangement of text in lines,(F) line number information representing approximate line numbers of lines,(G) section information representing at least approximate positions of sections,(H) font information representing font styles of characters,(I) font size information representing font sizes of characters,(J) superscript information indicating characters that are represented using superscripts,(K) subscript information indicating characters that are represented using subscripts,(L) bold attribute information indicating characters that are bolded, and(M) italicized attribute information indicating characters that are italicized.
8 Assignments
0 Petitions
Accused Products
Abstract
Generating a new document from a source text document and a source image document. The new document is generated by accessing the source text document and the source image document, paginating the source text document with the source image document to produce pagination information, and generating the new document using the pagination information.
203 Citations
26 Claims
-
1. A method of generating a new document from a source text document and a source image document, comprising the steps of:
-
(1) accessing said source text document and said source image document; (2) at least partially paginating said source text document with said source image document to produce at least partial pagination information; and (3) generating said new document using said at least partial pagination information, wherein said new document is an equivalent text file comprising one or more of (A)-(M); (A) information representing an approximate arrangement of at least some bibliographic data as represented in said source image document, (B) information that effectively provides an association between at least a portion of said source text document and at least a portion of said source image document, (C) special character information specifying at least one mapping of a group of characters in said source text document to at least one special character in said source image document, (D) column information representing at least an approximate arrangement of text in columns, (E) line information representing at least an approximate arrangement of text in lines, (F) line number information representing approximate line numbers of lines, (G) section information representing at least approximate positions of sections, (H) font information representing font styles of characters, (I) font size information representing font sizes of characters, (J) superscript information indicating characters that are represented using superscripts, (K) subscript information indicating characters that are represented using subscripts, (L) bold attribute information indicating characters that are bolded, and (M) italicized attribute information indicating characters that are italicized. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system of generating a new document from a source text document and a source image document, comprising:
-
accessing means for accessing said source text document and said source image document; paginating means for at least partially paginating said source text document with said source image document to produce at least partial pagination information; and document generating means for generating said new document using said at least partial pagination information, wherein said new document is an equivalent text file comprising one or more of (A)-(M); (A) information representing an approximate arrangement of at least some bibliographic data as represented in said source image document, (B) information that effectively provides an association between at least a portion of said source text document and at least a portion of said source image document, (C) special character information specifying at least one mapping of a group of characters in said source text document to at least one special character in said source image document, (D) column information representing at least an approximate arrangement of text in columns, (E) line information representing at least an approximate arrangement of text in lines, (F) line number information representing approximate line numbers of lines, (G) section information representing at least approximate positions of sections, (H) font information representing font styles of characters, (I) font size information representing font sizes of characters, (J) superscript information indicating characters that are represented using superscripts, (K) subscript information indicating characters that are represented using subscripts, (L) bold attribute information indicating characters that are bolded, and (M) italicized attribute information indicating characters that are italicized. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product having control logic stored therein, said control logic, when executed, enabling a computer to generate a new document from a source text document and a source image document, said control logic comprising:
-
accessing means for enabling the computer to access said source text document and said source image document; paginating means for enabling the computer to at least partially paginate said source text document with said source image document to produce at least partial pagination information; and document generating means for enabling the computer to generate said new document using said at least partial pagination information, wherein said new document is an equivalent text file comprising one or more of (A)-(M); (A) information representing an approximate arrangement of at least some bibliographic data as represented in said source image document, (B) information that effectively provides an association between at least a portion of said source text document and at least a portion of said source image document, (C) special character information specifying at least one mapping of a group of characters in said source text document to at least one special character in said source image document, (D) column information representing at least an approximate arrangement of text in columns, (E) line information representing at least an approximate arrangement of text in lines, (F) line number information representing approximate line numbers of lines, (G) section information representing at least approximate positions of sections, (H) font information representing font styles of characters, (I) font size information representing font sizes of characters, (J) superscript information indicating characters that are represented using superscripts, (K) subscript information indicating characters that are represented using subscripts, (L) bold attribute information indicating characters that are bolded, and (M) italicized attribute information indicating characters that are italicized. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
-
25. A method of generating a new patent document, comprising the steps of:
-
(1) receiving an electronic patent text file and an electronic patent image file originating from a source, said electronic patent text file and said electronic patent image file corresponding to a patent; (2) at least partially paginating said electronic patent text file with said electronic patent image file to produce at least partial pagination information; and (3) generating said new patent document using said at least partial pagination information, wherein said new patent document is an equivalent text file comprising one or more of (A)-(M); (A) information representing an approximate arrangement of at least some bibliographic data as represented in said source image document, (B) information that effectively provides an association between at least a portion of said source text document and at least a portion of said source image document, (C) special character information specifying at least one mapping of a group of characters in said source text document to at least one special character in said source image document, (D) column information representing at least an approximate arrangement of text in columns, (E) line information representing at least an approximate arrangement of text in lines, (F) line number information representing approximate line numbers of lines, (G) section information representing at least approximate positions of sections, (H) font information representing font styles of characters, (I) font size information representing font sizes of characters, (J) superscript information indicating characters that are represented using superscripts, (K) subscript information indicating characters that are represented using subscripts, (L) bold attribute information indicating characters that are bolded, and (M) italicized attribute information indicating characters that are italicized. - View Dependent Claims (26)
-
Specification