Electronic book pagination
First Claim
Patent Images
1. A computer-implemented method of paginating an electronic book, the method comprising:
- under control of one or more processors configured with executable instructions,capturing images of pages of a print version of the electronic book;
performing optical character recognition on the images;
based at least in part on the optical character recognition, identifying page text strings and associated page labels corresponding to individual pages;
for an individual page text string, autocorrelating with at least a portion of the electronic book to determine a plurality of candidate matching positions within the electronic book;
at each respective candidate matching position within the electronic book, comparing the individual page text string to a respective text string of the electronic book corresponding to the respective candidate matching position determined from the autocorrelating;
selecting a page position from the plurality of candidate matching positions based at least in part on the comparing; and
mapping between the selected page position and the page label associated with the individual page text string.
1 Assignment
0 Petitions
Accused Products
Abstract
An electronic book can be paginated by reference to a print version of the same book. Pages of the print version are scanned to obtain text strings and page labels corresponding to each of the pages. The text strings are then compared to the electronic book to find the best matching positions within the electronic book. The matching positions within the electronic book are then associated with the page numbers of the pages from which the matching text strings were obtained. Autocorrelation can be used to determine matching positions.
-
Citations
29 Claims
-
1. A computer-implemented method of paginating an electronic book, the method comprising:
-
under control of one or more processors configured with executable instructions, capturing images of pages of a print version of the electronic book; performing optical character recognition on the images; based at least in part on the optical character recognition, identifying page text strings and associated page labels corresponding to individual pages; for an individual page text string, autocorrelating with at least a portion of the electronic book to determine a plurality of candidate matching positions within the electronic book; at each respective candidate matching position within the electronic book, comparing the individual page text string to a respective text string of the electronic book corresponding to the respective candidate matching position determined from the autocorrelating; selecting a page position from the plurality of candidate matching positions based at least in part on the comparing; and mapping between the selected page position and the page label associated with the individual page text string. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented method comprising:
-
under control of one or more processors configured with executable instructions, performing machine-based optical character recognition on individual pages of a print version of a book to obtain page labels and page text from the individual pages; finding positions within an electronic version of the book that correspond to individual pages of the print version, wherein the finding is based at least in part on; autocorrelation between the page text and the electronic version to identify a plurality of candidate positions for respective ones of the individual pages, and comparing the page text and text of the electronic version at the plurality of candidate positions to select found positions from the plurality of candidate positions determined from the autocorrelation; and associating the found positions within the electronic version with the page labels of the corresponding individual pages of the print version. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system comprising:
-
one or more processors; one or more computer-readable media containing instructions that are executable by the one or more processors to perform actions comprising; obtaining page text and corresponding page labels from individual pages of a print version of a book; determining a page-to-label mapping between the individual pages and the page labels obtained from the print version; comparing the page text of the individual pages to an electronic version of the book to determine the positions of the individual pages within the electronic version; creating, based at least in part on the comparing and the page-to-label mapping, a page/position map as a data object separate from content of the electronic version, wherein the page/position map indicates correspondences between the page labels obtained from the print version and positions within the electronic version; and paginating the electronic version in accordance with the comparing. - View Dependent Claims (17, 18, 19, 20)
-
-
21. A computer-implemented method comprising:
-
under control of one or more processors configured with executable instructions, autocorrelating a first character string against substrings at different positions within a second character string to produce autocorrelation scores corresponding to the substrings; selecting a matching position within the second character string based at least in part on the autocorrelation scores corresponding to the substrings, the matching position corresponding to a first substring of the substrings; and subsequent to autocorrelating the first character string, autocorrelating a third character string against the substrings of the second character string, wherein at least one of the substrings is at a position within the second character string that is prior to the matching position of the first substring. - View Dependent Claims (22, 23, 24, 25, 26)
-
-
27. A method comprising:
-
determining, by one or more processors, page text corresponding to individual pages of a print version of a book; determining, by the one or more processors, page labels included on the individual pages of the print version; determining, by the one or more processors, a page-to-label mapping between the individual pages and the page labels from the print version; determining, by the one or more processors, positions within an electronic version of the book that correspond to the individual pages of the print version, wherein the determining is based at least in part on comparing the page text of the individual pages of the print version to text of the electronic version; and generating, by the one or more processors, based at least in part on the page-to-label mapping and the comparing, a page/position map, wherein the page/position map indicates correspondences between the page labels from the print version of the book and the positions within the electronic version of the book. - View Dependent Claims (28, 29)
-
Specification