Data normalization for handwriting recognition
First Claim
Patent Images
1. A method of optimizing normalization for handwriting recognition, comprising:
- obtaining an image comprising handwriting data corresponding to at least one word;
pre-processing the obtained image to produce a pre-processed image comprising multiple pixels;
normalizing a height of the pre-processed image to generate a normalized image by;
calculating a vertical histogram for the pre-processed image,detecting a middle height of the vertical histogram,increasing a major component height from the middle height until a pre-determined count of pixels out of the multiple pixels of the pre-processed image are covered,calculating a ratio between a pre-determined fixed height and the major component height, andzooming in or out of the pre-processed image at the calculated ratio to generate the normalized image;
sending the normalized image and a normalization setting to RNN (Recurrent Neural Network); and
obtaining a recognition result from the RNN for the normalized image based on the normalization setting.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed herein is a method of optimizing data normalization by selecting the best height normalization setting from training RNN (Recurrent Neural Network) with one or more datasets comprising multiple sample images of handwriting data, which comprises estimating a few top place ratios for normalization by minimizing a cost function for any given sample image in the training dataset, and further, determining the best ratio from the top place ratios by validating the recognition results of sample images with each top place ratio.
-
Citations
27 Claims
-
1. A method of optimizing normalization for handwriting recognition, comprising:
-
obtaining an image comprising handwriting data corresponding to at least one word; pre-processing the obtained image to produce a pre-processed image comprising multiple pixels; normalizing a height of the pre-processed image to generate a normalized image by; calculating a vertical histogram for the pre-processed image, detecting a middle height of the vertical histogram, increasing a major component height from the middle height until a pre-determined count of pixels out of the multiple pixels of the pre-processed image are covered, calculating a ratio between a pre-determined fixed height and the major component height, and zooming in or out of the pre-processed image at the calculated ratio to generate the normalized image; sending the normalized image and a normalization setting to RNN (Recurrent Neural Network); and obtaining a recognition result from the RNN for the normalized image based on the normalization setting. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer program product comprising a computer usable non-transitory medium having a computer readable program code embedded therein for controlling a data processing apparatus, the computer readable program code configured to cause the data processing apparatus to execute a process for optimizing data normalization for handwriting recognition, the process comprising:
-
obtaining an image comprising handwriting data corresponding to at least one word; pre-processing the obtained image to produce a pre-processed image comprising multiple pixels; normalizing a height of the pre-processed image to generate a normalized image by; calculating a vertical histogram for the pre-processed image, detecting a middle height of the vertical histogram, increasing a major component height from the middle height until a pre-determined count of pixels out of the multiple pixels of the pre-processed image are covered, calculating a ratio between a pre-determined fixed height and the major component height, and zooming in or out of the pre-processed image at the calculated ratio to generate the normalized image; sending the normalized image and a normalization setting to RNN (Recurrent Neural Network); and obtaining a recognition result from the RNN for the normalized image based on the normalization setting. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An apparatus for optimizing normalization for handwriting recognition, comprising:
-
a processor configured for obtaining an image comprising handwriting data corresponding to at least one word; a pre-processor configured for pre-processing the obtained image to produce a pre-processed image comprising multiple pixels; and a normalizer configured for normalizing a height of the pre-processed image to generate a normalized image by; calculating a vertical histogram for the pre-processed image, detecting a middle height of the vertical histogram, increasing a major component height from the middle height until a pre-determined count of pixels out of the multiple pixels of the pre-processed image are covered, calculating a ratio between a pre-determined fixed height and the major component height, and zooming in or out of the pre-processed image at the calculated ratio to generate the normalized image; wherein the processor is further configured for sending the normalized image and a normalization setting to RNN (Recurrent Neural Network) and obtaining a recognition result from the RNN for the normalized image based on the normalization setting. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
Specification