Graphical system for automated segmentation and recognition for image recognition systems
First Claim
1. In symbol recognition apparatus comprising an image sensor, an image segmenter, a memory, a recognizer including a library of known images and a comparison means for generating a measure of probability that a sensed image is a one of said known images, the improvement in recognizing one or more unknown alphanumeric symbols, comprising:
- means for sensing an image of said symbol string with said image sensor;
means for segmenting the image of said symbol string with said segmenter to construct candidate cuts through said string, thereby creating a multiplicity of image segments potentially greater in number than the symbols present in said string and sufficient in number to include the correct segmentation;
means for arranging in said memory said segments as a sequence of nodes, the nodes at the beginning point of said sequence corresponding to the left-hand side of said string, and the nodes at the end-point in said sequence corresponding to the right-hand side of said string;
means for identifying all nodes corresponding to segments which are legal neighbors and connecting with arcs from left to right all such nodes, thereby creating multiple paths across said sensed image, each said path corresponding to a particular legal segmentation of said image;
means for comparing said segments to predetermined data in a definite cut database thereby to identify all nodes which straddle a definite cut;
means for eliminating from said memory said last-named nodes as well as all arcs connected to each said node;
means for identifying nodes which because of the preceding step no longer have a parent-child connection, and recursively eliminating all said nodes and associated arcs from said memory;
means for generating in said recognizer a measure of probability for each remaining node returning said measure to said memory for association with the respective node; and
means for combining said probability measures associated with all of the nodes in each said remaining node path, thereby to produce outputs consisting of a symbol identification and a total probability measure associated with each particular node path.
2 Assignments
0 Petitions
Accused Products
Abstract
Apparatus and processes are described for the automatic recognition of alphanumeric images. A set of cuts are made to the image which include incorrect segmentations. The resulting "cells" comprising in their totality the created segments of the image are then analyzed to determine which cells are legal neighbors and which are not. All cells which are legal neighbors are then presented as connected nodes. A pruning of nodes which are related to certain predetermined image cuts is effected. Each set of remaining connected nodes is then presented to a recognizer which identifies the image and assigns a specified probability to the output. Many cells which are not legal neighbors are thereby not presented to the recognizer, thus saving substantially on computations per recognized image.
-
Citations
29 Claims
-
1. In symbol recognition apparatus comprising an image sensor, an image segmenter, a memory, a recognizer including a library of known images and a comparison means for generating a measure of probability that a sensed image is a one of said known images, the improvement in recognizing one or more unknown alphanumeric symbols, comprising:
-
means for sensing an image of said symbol string with said image sensor; means for segmenting the image of said symbol string with said segmenter to construct candidate cuts through said string, thereby creating a multiplicity of image segments potentially greater in number than the symbols present in said string and sufficient in number to include the correct segmentation; means for arranging in said memory said segments as a sequence of nodes, the nodes at the beginning point of said sequence corresponding to the left-hand side of said string, and the nodes at the end-point in said sequence corresponding to the right-hand side of said string; means for identifying all nodes corresponding to segments which are legal neighbors and connecting with arcs from left to right all such nodes, thereby creating multiple paths across said sensed image, each said path corresponding to a particular legal segmentation of said image; means for comparing said segments to predetermined data in a definite cut database thereby to identify all nodes which straddle a definite cut; means for eliminating from said memory said last-named nodes as well as all arcs connected to each said node; means for identifying nodes which because of the preceding step no longer have a parent-child connection, and recursively eliminating all said nodes and associated arcs from said memory; means for generating in said recognizer a measure of probability for each remaining node returning said measure to said memory for association with the respective node; and means for combining said probability measures associated with all of the nodes in each said remaining node path, thereby to produce outputs consisting of a symbol identification and a total probability measure associated with each particular node path. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. In symbol recognition apparatus comprising an image sensor, an image segmenter, a memory, a neural network recognizer including a library of known images and a comparison means for generating a measure of probability that a sensed image is a one of said known images, the improvement in recognizing one or more unknown alphanumeric symbols, comprising:
-
means for sensing an image of said one or more symbols with said image sensor and converting said sensed image to pixel format;
means for segmenting said pixel image of said symbols with said segmenter to construct candidate cuts through said symbols, thereby creating a multiplicity of image cells greater in number than the symbols present, and sufficient in number to include the correct segmentation;means for arranging in said memory said segments as a graphical array of several node sequences, the nodes at the beginning point of said sequence corresponding to the left-hand side of said graph, and the nodes at the end-point in said sequence corresponding to the right-hand side of said graph; means for identifying all nodes corresponding to segments which are legal neighbors and connecting with graphed arcs from left to right all such nodes, thereby creating multiple paths across said sensed image, each said path corresponding to a particular legal segmentation of said image; means for providing a predetermined database of definite cuts information and comparing said cuts to said database information thereby to identify all nodes which straddle a definite cut; means for eliminating from said memory said last-named nodes as well as all arcs connected to each said node; means for generating in said neural network recognizer a said measure of probability for each remaining node returning said measure to said memory for association with the respective node; and means for combining said probability measures associated with all of the nodes in each said remaining node path, thereby to produce outputs consisting of a symbol identification and a total probability measure associated with each particular node path. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. In an alphanumeric symbol recognition engine comprising an image sensor, an image segmenter, a memory, a neural network recognizer including a library for receiving known images and a comparison means for generating multiple measures of probability for a known inputted image with respect to all possible alphanumeric members of the class to which said inputted image belongs, apparatus for automatically generating individual character training images using input images and their corresponding "truth" information, comprising:
-
means for sensing one or more images of said known alphanumeric symbols with said image sensor and converting said sensed images to pixel format; means for segmenting said pixel images of said alphanumeric symbols with said segmenter to construct candidate cuts through said images, thereby creating a multiplicity of sequential image cells possibly greater in number than the alphanumeric symbols presented, and sufficient in number to include the correct segmentation of each said image; means for arranging in said memory said cells as a graphical array of several node sequences, the nodes at the beginning point of said sequence corresponding to the left-hand side of said graph, and the nodes at the end-point in said sequence corresponding to the right-hand side of said graph; means for identifying all nodes corresponding to segments which are legal neighbors and connecting with graphed arcs from left to right all such nodes, thereby creating multiple paths across said sensed images, each said path corresponding to a particular legal segmentation of said images in one-to-one correspondence; means for providing a predetermined database of definite cuts information and comparing said cuts to said database information thereby to identify all nodes which straddle a definite cut; means for eliminating from said memory said last-named nodes as well as all arcs connected to each said node; means for generating in said neural network recognizer a said measure of probability for each remaining node and returning said measure to said memory for association with the respective node; and means for combining and storing all said probability measures associated with all of the nodes in all said remaining node paths, thereby to produce alphanumeric identification data for said library consisting of a symbol identification and a total probability measure associated with all particular node paths; whereby segmentation of said image is substantially automatic. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A method for machine-reading a string of unknown alphanumeric symbols on a graphical record, comprising
a) with an image sensor, scanning the graphical record, thereby to produce a pixel image of the symbol string; -
b) segmenting the pixel image into a plurality of cells equal in number to at least the expected number of alphanumeric symbols in the string; c) storing each resulting cell in a memory device; d) selecting at least some of the cells; e) recalling each of the selected cells from the memory device and presenting said cell as input to a recognition engine; and f) receiving from the recognition engine a probability measure which indicates the likelihood that the cell belongs to the same class as some member of a library of known images, wherein; g) the storing step comprises arranging in said memory said cells, or combinations thereof, as a sequence of nodes, the nodes at the beginning point of said sequence corresponding to the left-hand side of said string, and the nodes at the end-point in said sequence corresponding to the right-hand side of said string; h) the selecting step comprises identifying all nodes corresponding to segments which are legal neighbors and connecting with arcs from left to right all such nodes, thereby creating multiple paths across said sensed image, each said path corresponding to a particular legal segmentation of said image; i) the selecting step further comprises comparing said segments to predetermined data in a definite cut database thereby to identify all nodes which straddle a definite cut of said string image; j) the selecting step further comprises eliminating from said memory said last-named nodes as well as all arcs leading from or terminating on each said node; k) the receiving step is performed such that a probability measure is thereby associated with each of the nodes that remain after step (j); and l) the method further comprises combining said probability measures associated with all of the nodes in each said remaining node path, thereby to produce outputs consisting of a symbol identification and a total probability measure associated with each particular node path. - View Dependent Claims (20, 21, 22, 23, 24)
-
-
25. A method for training a neural network recognition engine, including a reference library for recording therein known images, to recognize alphanumeric symbols, the method comprising:
-
a) with an image sensor, scanning at least one graphical record, thereby to produce a pixel image of a symbol string borne by said record; b) segmenting the pixel image into a plurality of cells equal in number to at least the expected number of alphanumeric symbols in the string; c) storing each resulting cell in a memory device; d) selecting at least some of the cells; e) recalling each of the selected cells from the memory device and presenting said cell as input to the recognition engine; and f) receiving from the recognition engine a probability measure which indicates the likelihood that the cell belongs to the same class as some member of a library of known images, wherein; g) the storing step comprises arranging in said memory said cells, or combinations thereof, as a sequence of nodes, the nodes at the beginning point of said sequence corresponding to the left-hand side of said string, and the nodes at the end-point in said sequence corresponding to the right-hand side of said string; h) the selecting step comprises identifying all nodes corresponding to segments which are legal neighbors and connecting with arcs from left to right all such nodes, thereby creating multiple paths across said sensed image, each said path corresponding to a particular legal segmentation of said image; i) the selecting step further comprises comparing said segments to predetermined data in a definite cut database thereby to identify all nodes which straddle a definite cut of said string image; j) the selecting step further comprises eliminating from said memory said last-named nodes as well as all arcs leading from or terminating on each said node; k) the receiving step is performed such that a probability measure is thereby associated with each of the nodes that remain after the eliminating step; l) the method further comprises combining said probability measures associated with all of the nodes in each said remaining node path, thereby to produce alphanumeric identification dam comprising a symbol identification and a total probability measure associated with each particular node path; and m) recording said alphanumeric identification data in the library. - View Dependent Claims (26, 27, 28, 29)
-
Specification