Method and system for determining the legibility of text in an image
First Claim
1. A computer-implemented method for displaying a page image based on a determined legibility of text in the page image, the method comprising:
- (a) obtaining an image of a page at a base resolution;
(b) obtaining a measure of the page image height at the base resolution;
(c) performing text recognition on text in the page image to obtain a measure of text height of the text of the page image at the base resolution;
(d) dividing the text height by the page image height to produce a text-to-page height ratio; and
(e) obtaining a new image of the page at a resolution higher than the base resolution when the text-to-page height ratio is below a threshold that indicates legibility.
0 Assignments
0 Petitions
Accused Products
Abstract
Legibility of text in an image of a page is determined by comparing a measure of the text in the page image with a measure of the page image itself. In one aspect, a measure of the text in the page image may be the height of a line of text, while the measure derived from the page image may be the height of the page image. A text-to-page height ratio is determined and compared to one or more thresholds for determining legibility. In another aspect of the invention, a measure of the text in a page image is obtained by measuring the word density in the page image, while the measure derived from the page image comprises compressing the page image and determining the size of the compressed image file. Legibility is then determined by comparing the measure of word density with the compressed image file size.
-
Citations
49 Claims
-
1. A computer-implemented method for displaying a page image based on a determined legibility of text in the page image, the method comprising:
-
(a) obtaining an image of a page at a base resolution; (b) obtaining a measure of the page image height at the base resolution; (c) performing text recognition on text in the page image to obtain a measure of text height of the text of the page image at the base resolution; (d) dividing the text height by the page image height to produce a text-to-page height ratio; and (e) obtaining a new image of the page at a resolution higher than the base resolution when the text-to-page height ratio is below a threshold that indicates legibility.
-
-
2. A computer-implemented method for displaying a page image based on a determined legibility of text in the page image, the method comprising:
-
(a) obtaining an image of a page at a base resolution; (b) analyzing the legibility of text in the page image and determining that text in the page image is not legible; and (c) as a result of determining that text in the page image is not legible, obtaining a new image of the page at a resolution higher than the base resolution. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer-implemented multistage method for automated determination of legibility of text in an image of a page, comprising, under the control of instructions executed by one or more computer processors:
-
(a) obtaining an image of a page having text therein; (b) measuring the legibility of the text by applying a first test of legibility to the text in the page image, wherein the first test of legibility includes determining a height of one or more lines of text in the page image; (c) measuring the legibility of the text by applying a second test of legibility to the text in the page image; and (d) storing the page image for display if the text in the page image is determined to be legible. - View Dependent Claims (22, 23, 24, 25)
-
-
26. A computer-readable storage medium having stored thereon computer-executable instructions that, if executed by one or more computer processors, cause the processor one or more processors to execute a method for displaying a page image based on a determined legibility of text in the page image, the method comprising:
-
(a) obtaining an image of a page at a base resolution; (b) analyzing the legibility of text in the page image and determining that text in the page image is not legible; and (c) as a result of determining that text in the page image is not legible, obtaining a new image of the page obtained at a resolution higher than the base resolution. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44)
-
-
45. A computer-readable storage medium having computer-executable instructions stored thereon that, if executed by one or more processors of a computing device, cause the computing device to execute a multistage method for automated determination of legibility of text in an image of a page, the method comprising:
-
(a) obtaining an image of a page having text therein; (b) measuring the legibility of the text by applying a first test of legibility to the text in the page image, wherein the first test of legibility includes determining a height of one or more lines of text in the page image; (c) measuring the legibility of the text by applying a second test of legibility to the text in the page image; and (d) storing the page image for display if the text in the page image is determined to be legible. - View Dependent Claims (46, 47, 48, 49)
-
Specification