Path score calculating method for intelligent character recognition
First Claim
1. A method for handwriting recognition, comprising:
- detecting connected regions in an input image comprising handwritten characters;
determining a plurality of segmentation positions of the input image;
obtaining multiple alternative paths of the input image, each path containing one or more segments, each path covering all of the connected regions of the input image, the multiple alternative paths being different from each other;
for each path;
obtaining a plurality of recognition results for the segments of the path, wherein each recognition result represents a character candidate for a corresponding segment;
obtaining a plurality of scores corresponding to the recognition results of the segments; and
calculating a path score for the path based on segment lengths and the scores corresponding to the recognition results of the segments using the following equation;
Spath=a*(S11*Len1+S12*Len2+ . . . +S1m*Lenm)+b*min(S11, S12, . . . S1m), wherein Len1, Len2 . . . Lenm represent respective segment lengths for Seg1, Seg2, . . . , Segm of the path, (S11, S12, . . . , S1m) represent scores corresponding to recognition results (R11, R12, . . . R1m) of a decoding path, min means the minimization function, and a and b are pre-defined constants; and
identifying the path with the highest path score and using that path to interpret the handwritten characters in the input image.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed herein is a method that improves the performance of handwriting recognition by calculating path scores so as to identify the path with the highest score as the basis for interpreting handwritten characters. Specifically, the method comprises the following steps: detecting connected regions in an input image comprising handwritten characters; determining a plurality of segmentation positions of the input image; obtaining a plurality of recognition results for each segment of each path in the input image, wherein each recognition result represents a character candidate for the segment and each path comprises one or more segments; obtaining a plurality of scores corresponding to the recognition results; calculating scores for each path in the input image based on segment lengths and the scores corresponding to the recognition results; and using the path with the highest score to interpret the handwritten characters in the input image.
-
Citations
19 Claims
-
1. A method for handwriting recognition, comprising:
-
detecting connected regions in an input image comprising handwritten characters; determining a plurality of segmentation positions of the input image; obtaining multiple alternative paths of the input image, each path containing one or more segments, each path covering all of the connected regions of the input image, the multiple alternative paths being different from each other; for each path; obtaining a plurality of recognition results for the segments of the path, wherein each recognition result represents a character candidate for a corresponding segment; obtaining a plurality of scores corresponding to the recognition results of the segments; and calculating a path score for the path based on segment lengths and the scores corresponding to the recognition results of the segments using the following equation;
Spath=a*(S11*Len1+S12*Len2+ . . . +S1m*Lenm)+b*min(S11, S12, . . . S1m), wherein Len1, Len2 . . . Lenm represent respective segment lengths for Seg1, Seg2, . . . , Segm of the path, (S11, S12, . . . , S1m) represent scores corresponding to recognition results (R11, R12, . . . R1m) of a decoding path, min means the minimization function, and a and b are pre-defined constants; andidentifying the path with the highest path score and using that path to interpret the handwritten characters in the input image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer program product comprising a computer usable non-transitory medium having a computer readable program code embedded therein for controlling a data processing apparatus, the computer readable program code configured to cause the data processing apparatus to execute a process for handwriting recognition, the process comprising:
-
detecting connected regions in an input image comprising handwritten characters; determining a plurality of segmentation positions of the input image; obtaining multiple alternative paths of the input image, each path containing one or more segments, each path covering all of the connected regions of the input image, the multiple alternative paths being different from each other; for each path; obtaining a plurality of recognition results for the segments of the path, wherein each recognition result represents a character candidate for a corresponding segment; obtaining a plurality of scores corresponding to the recognition results of the segments; and calculating a path score for the path based on segment lengths and the scores corresponding to the recognition results of the segments using the following equation;
Spath=a*(S11*Len1+S12*Len2+ . . . +S1m*Lenm)+b*min(S11, S12, . . . S1m), wherein Len1, Len2 . . . Lenm represent respective segment lengths for Seg1, Seg2, . . . , Segm of the path, (S11, S12, . . . , S1m) represent scores corresponding to recognition results (R11, R12 . . . R1m) of a decoding path, min means the minimization function, and a and b are pre-defined constants; andidentifying the path with the highest path score and using that path to interpret the handwritten characters in the input image. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computer program product comprising a computer usable non-transitory medium having a computer readable program code embedded therein for controlling a data processing apparatus, the computer readable program code configured to cause the data processing apparatus to execute a process for handwriting recognition, the process comprising:
-
detecting connected regions in an input image comprising handwritten characters; estimating an upper baseline and a lower baseline for the input image based on the connected regions, wherein the upper and lower baselines are estimated by; detecting top and bottom positions of each connected region of the input image; clustering the top positions into a first higher center and a first lower center, wherein the first lower center is selected as the upper baseline; and clustering the bottom positions into a second higher center and a second lower center, wherein the second higher center is selected as the lower baseline; estimating a character height based on the upper and lower baselines; determining a plurality of segmentation positions of the input image; obtaining multiple alternative paths of the input image, each path containing one or more segments, each path covering all of the connected regions of the input image, the multiple alternative paths being different from each other; for each path; obtaining a plurality of recognition results for the segments of the path, wherein each recognition result represents a character candidate for a corresponding segment; obtaining a plurality of scores corresponding to the recognition results of the segments; and calculating a path score for the path based on segment lengths and the scores corresponding to the recognition results of the segments; and identifying the path with the highest path score and using that path to interpret the handwritten characters in the input image.
-
-
17. A computer program product comprising a computer usable non-transitory medium having a computer readable program code embedded therein for controlling a data processing apparatus, the computer readable program code configured to cause the data processing apparatus to execute a process for handwriting recognition, the process comprising:
-
detecting connected regions in an input image comprising handwritten characters; determining a plurality of segmentation positions of the input image, wherein the segmentation positions are determined by; obtaining a top contour and a bottom contour of the input image; selecting a plurality of extremum points in the top and bottom contours as potential segmentation positions; and for each of the plurality of potential segmentation positions, drawing a vertical line at the potential segmentation position, determining whether the vertical line crosses a foreground of the input image more than two times and if so, deleting the potential segmentation position, and determining whether the vertical line crosses a circle in the foreground and if so, deleting the potential segmentation position; obtaining multiple alternative paths of the input image, each path containing one or more segments, each path covering all of the connected regions of the input image, the multiple alternative paths being different from each other; for each path; obtaining a plurality of recognition results for the segments of the path, wherein each recognition result represents a character candidate for a corresponding segment; obtaining a plurality of scores corresponding to the recognition results of the segments; and calculating a path score for the path based on segment lengths and the scores corresponding to the recognition results of the segments; and identifying the path with the highest path score and using that path to interpret the handwritten characters in the input image. - View Dependent Claims (18, 19)
-
Specification