Method of and apparatus for segmenting foreground and background information for optical character recognition of labels employing single layer recurrent neural network
First Claim
1. A method of segmenting foreground areas from background areas of an input image including input pixels having grey scale input pixel values, comprising the steps of:
- defining an output image comprising output pixels having binary output pixel values;
assigning each input pixel and each output pixel a position in a coordinate system so that the input image and the output image correspond to the physical configuration of the substrate;
assigning a scaling factor to each input pixel so that a scaling factor assigned to an input pixel represents an estimate of the background input pixel values in a neighborhood about the input pixel;
defining an energy function that varies as a function of the input pixels values, the output pixels values, and the scaling factors comprisinga first component expressing a cost associated with an output pixel value not being in agreement with the value of a corresponding input pixel taking into account the value of the scaling factor assigned to the corresponding input pixel,a second component expressing a cost associated with the value of an output pixel not being in agreement with the values of pixels in a neighborhood around the output pixel, anda third component expressing a cost associated with the gradient of the input pixel values so as to define boundaries between regions of the input image having contrasting input pixel values; and
segmenting foreground areas of the output image from background areas of the output image by iteratively varying the output pixel values so that the energy function converges toward a minimum value.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for processing a greyscale input of an image, particularly of a shipping label, into a binary output image in which foreground information is segmented from the background information and contrasts between adjacent regions having different background densities are obliterated is described. A neuron employing a 5×5 input neighborhood having a unique neuron activation function is shown. No explicit line process is employed. Output is biased toward a particular one of the output values by employing non-linear feedback as a function of both the grey scale value for the pixel corresponding to the label site being updated and the most recent value of the label site. The otherwise strong contribution from a gradient term in the energy function is suppressed by a shunting inhibition when the shunting inhibition function detects that the pixel lies on or near a boundary between adjacent regions of differing background intensities.
95 Citations
12 Claims
-
1. A method of segmenting foreground areas from background areas of an input image including input pixels having grey scale input pixel values, comprising the steps of:
-
defining an output image comprising output pixels having binary output pixel values; assigning each input pixel and each output pixel a position in a coordinate system so that the input image and the output image correspond to the physical configuration of the substrate; assigning a scaling factor to each input pixel so that a scaling factor assigned to an input pixel represents an estimate of the background input pixel values in a neighborhood about the input pixel; defining an energy function that varies as a function of the input pixels values, the output pixels values, and the scaling factors comprising a first component expressing a cost associated with an output pixel value not being in agreement with the value of a corresponding input pixel taking into account the value of the scaling factor assigned to the corresponding input pixel, a second component expressing a cost associated with the value of an output pixel not being in agreement with the values of pixels in a neighborhood around the output pixel, and a third component expressing a cost associated with the gradient of the input pixel values so as to define boundaries between regions of the input image having contrasting input pixel values; and segmenting foreground areas of the output image from background areas of the output image by iteratively varying the output pixel values so that the energy function converges toward a minimum value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An apparatus for segmenting foreground areas from background areas of an input image including input pixels having grey scale input pixel values, comprising:
-
a computer storage medium for defining an output image comprising output pixels having binary output pixel values; means for assigning each input pixel and each output pixel a position in a coordinate system so that the input image and the output image correspond to the physical configuration of the substrate; means for assigning a scaling factor to each input pixel so that a scaling factor assigned to an input pixel represents an estimate of the background input pixel values in a neighborhood about the input pixel; means for defining an energy function that varies as a function of the input pixels values, the output pixels values, and the scaling factors comprising, a first component expressing a cost associated with an output pixel value not being in agreement the value of a corresponding input pixel taking into account the value of the scaling factor assigned to the corresponding input pixel, a second component expressing a cost associated with the value of an output pixel not being in agreement with the values of its neighboring output pixels, and a third component expressing a cost associated with the gradient of the input pixel values so as to define boundaries between regions of the input image having contrasting input pixel values; and means for segmenting foreground areas of the output image from background areas of the output image by iteratively varying the output pixel values so that the energy function converges toward a minimum value. - View Dependent Claims (10, 11, 12)
-
Specification