Table data recovering in case of image distortion
First Claim
1. A method comprising:
- identifying edges of at least a section of a document in a source image, wherein the document comprises at least one first row of text;
identifying a plurality of characters in the document;
identifying ones of the characters as a plurality of word portions;
generating a plurality of first polynomials that approximate points of the characters within the word portions;
generating a second polynomial that approximates the points of the characters of at least a first word portion and a second word portion among the word portions and corresponds to the first row of text;
identifying at least one stretching coefficient of the first row of text based on a maximum length of the section between the edges and a length of the second polynomial; and
mapping, by at least one processing device, portions of the source image along the first row of text to new positions in a corrected image based on the second polynomial and the stretching coefficient.
3 Assignments
0 Petitions
Accused Products
Abstract
The subject matter of this specification can be implemented in, among other things, a method that includes identifying edges of a section of a document in a source image that includes at least one row of text. The method includes identifying characters in the document. The method includes identifying word portions. The method includes generating polynomials that approximate points of the characters within the word portions. The method includes generating a second polynomial that approximates the points of the characters of word portions. The method includes identifying a stretching coefficient of the row of text based on a length of the section between the edges relative to a length of the second polynomial. The method includes mapping portions of the source image along the row of text to new positions in a corrected image based on the second polynomial and the stretching coefficient.
5 Citations
20 Claims
-
1. A method comprising:
-
identifying edges of at least a section of a document in a source image, wherein the document comprises at least one first row of text; identifying a plurality of characters in the document; identifying ones of the characters as a plurality of word portions; generating a plurality of first polynomials that approximate points of the characters within the word portions; generating a second polynomial that approximates the points of the characters of at least a first word portion and a second word portion among the word portions and corresponds to the first row of text; identifying at least one stretching coefficient of the first row of text based on a maximum length of the section between the edges and a length of the second polynomial; and mapping, by at least one processing device, portions of the source image along the first row of text to new positions in a corrected image based on the second polynomial and the stretching coefficient. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A non-transitory computer-readable medium having instructions stored thereon that, when executed by at least one processing device, cause the processing device to:
-
identify edges of at least a section of a document in a source image, wherein the document comprises at least one first row of text; identify a plurality of characters in the document; identify ones of the characters as a plurality of word portions; generate a plurality of first polynomials that approximate points of the characters within the word portions; generate a second polynomial that approximates the points of the characters of at least a first word portion and a second word portion among the word portions and corresponds to the first row of text; identify at least one stretching coefficient of the first row of text based on a maximum length of the section between the edges and a length of the second polynomial; and map, by the processing device, portions of the source image along the first row of text to new positions in a corrected image based on the second polynomial and the stretching coefficient.
-
-
13. A system comprising:
-
at least one memory that stores instructions; and at least one processing device to execute the instructions to; identify edges of at least a section of a document in a source image, wherein the document comprises at least one first row of text; identify a plurality of characters in the document; identify ones of the characters as a plurality of word portions; generate a plurality of first polynomials that approximate points of the characters within the word portions; generate a second polynomial that approximates the points of the characters of at least a first word portion and a second word portion among the word portions and corresponds to the first row of text; identify at least one stretching coefficient of the first row of text based on a maximum length of the section between the edges and a length of the second polynomial; and map portions of the source image along the first row of text to new positions in a corrected image based on the second polynomial and the stretching coefficient. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification