Character recognition system which cuts between touching characters
First Claim
1. A method for cutting individual characters out of character image data which includes at least two touching characters, said method comprising the steps of:
- detecting a first pair of vertical bars in the character image data;
checking for a presence of image data in each of plural discrete horizontal bands located between the first pair of vertical bars;
classifying the first pair of vertical bars into one of plural classes based on which of the plural discrete horizontal bands contain image data;
measuring a distance between the first pair of vertical bars;
comparing the distance between the first pair of vertical bars to one of plural predetermined distances, the one of the plural predetermined distances corresponding to the class into which the first pair of vertical bars has been classified; and
determining whether and where to cut between the first pair of vertical bars based on at least one of the classification and the comparison between the first pair of vertical bars and the one of the plural predetermined distances, wherein, for all classifications of the first pair of vertical bars, when it is determined to cut, the determining step determines to cut at one of only two locations between the first pair of vertical bars.
1 Assignment
0 Petitions
Accused Products
Abstract
An optical character recognition system cuts between touching characters. A first cut is made between touching characters based on white spaces, and the cut characters are subjected to character recognition processing. All characters not recognized are then cut again. A pair of adjacent vertical bars is detected in a vertical histogram of character image data of unrecognized characters, the vertical bars having a vertical component in the histogram that exceeds a predetermined vertical threshold. Horizontal crossings are detected in each of three discrete horizontal bands between the vertical bars. The vertical bars are classified according to the detected horizontal crossings, and, based on the classification, a decision is made whether or not to cut between the vertical bars, and where to cut between the vertical bars.
-
Citations
120 Claims
-
1. A method for cutting individual characters out of character image data which includes at least two touching characters, said method comprising the steps of:
-
detecting a first pair of vertical bars in the character image data; checking for a presence of image data in each of plural discrete horizontal bands located between the first pair of vertical bars; classifying the first pair of vertical bars into one of plural classes based on which of the plural discrete horizontal bands contain image data; measuring a distance between the first pair of vertical bars; comparing the distance between the first pair of vertical bars to one of plural predetermined distances, the one of the plural predetermined distances corresponding to the class into which the first pair of vertical bars has been classified; and determining whether and where to cut between the first pair of vertical bars based on at least one of the classification and the comparison between the first pair of vertical bars and the one of the plural predetermined distances, wherein, for all classifications of the first pair of vertical bars, when it is determined to cut, the determining step determines to cut at one of only two locations between the first pair of vertical bars. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. An apparatus for cutting individual characters out of character image data which includes at least two touching characters, said apparatus comprising:
-
detecting means for detecting a first pair of vertical bars in the character image data; checking means for checking for a presence of image data in each of plural discrete horizontal bands located between the first pair of vertical bars; classifying means for classifying the first pair of vertical bars into one of plural classes based on which of the plural discrete horizontal bands contain image data; measuring means for measuring a distance between the first pair of vertical bars; comparing means for comparing the distance between the first pair of vertical bars to one of plural predetermined distances, the one of the plural predetermined distances corresponding to the class into which the first pair of vertical bars has been classified; and determining means for determining whether and where to cut between the first pair of vertical bars based on at least one of the classification and the comparison between the first pair of vertical bars and the one of the plural predetermined distances, wherein, for all classifications of the first pair of vertical bars, when it is determined to cut, the determining means determines to cut at one of only two locations between the first pair of vertical bars. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. An optical character recognition method for recognizing characters in input image data, comprising the steps of:
-
inputting image data including character image data which represents images of characters; cutting character image data out of the image data; performing optical character recognition processing on the cut character image data; selecting unrecognized cut character image data; detecting a first pair of vertical bars in the unrecognized cut character image data; checking for a presence of character image data in each of plural discrete horizontal bands located between the first pair of vertical bars; classifying the first pair of vertical bars into one of plural classes based on which of the plural discrete horizontal bands contain character image data; measuring a distance between the first pair of vertical bars; comparing the distance between the first pair of vertical bars to one of plural predetermined distances, the one of the plural predetermined distances corresponding to the class into which the first pair of vertical bars has been classified; determining whether and where to cut between the first pair of vertical bars in the unrecognized cut character image data based on at least one of the classification and the comparison between the first pair of vertical bars and the one of the plural predetermined distances, wherein, for all classifications of the first pair of vertical bars, when it is determined to cut, the determining step determines to cut at one of only two locations between the first pair of vertical bars; cutting between the first pair of vertical bars in the unrecognized character image data when it is determined to cut between the vertical bars; performing optical character recognition processing on newly-cut character image data; and outputting recognized characters. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45)
-
-
46. An optical character recognition apparatus for recognizing characters in input image data, comprising:
-
inputting means for inputting image data including character image data which represents images of characters; cutting means for cutting character image data in the image data; optical character recognition processing means for performing optical character recognition processing on the cut character image data; selecting means for selecting unrecognized cut character image data; detecting means for detecting a first pair of vertical bars in the unrecognized cut character image data; checking means for checking for a presence of character image data in each of plural discrete horizontal bands located between the first pair of vertical bars; classifying means for classifying the first pair of vertical bars into one of plural classes based on which of the plural discrete horizontal bands contain character image data; measuring means for measuring a distance between the first pair of vertical bars; comparing means for comparing the distance between the first pair of vertical bars to one of plural predetermined distances, the one of the plural predetermined distances corresponding to the class into which the first pair of vertical bars has been classified; determining means for determining whether and where to cut between the first pair of vertical bars in the unrecognized cut character image data based on at least one of the classification and the-comparison between the first pair of vertical bars and the one of the plural predetermined distances, wherein, for all classifications of the first pair of vertical bars, when it is determined to cut, the determining means determines to cut at one of only two locations between the first pair of vertical bars; bar cutting means for cutting between the first pair of vertical bars in the unrecognized character image data when it is determined to cut between the first pair of vertical bars; optical character recognition processing means for performing optical character recognition processing on newly-cut character image data; and outputting means for outputting recognized characters. - View Dependent Claims (47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60)
-
-
61. Computer-executable process steps stored in a computer-readable medium, the computer-executable process steps to cut individual characters out of character image data which includes at least two touching characters, the computer-executable process steps comprising:
-
code to detect a first pair of vertical bars in the character image data; code to check for a presence of image data in each of plural discrete horizontal bands located between the first pair of vertical bars; code to classify the first pair of vertical bars into one of plural classes based on which of the plural discrete horizontal bands contain image data; code to measure a distance between the first pair of vertical bars; code to compare the distance between the first pair of vertical bars to one of plural predetermined distances, the one of the plural predetermined distances corresponding to the class into which the first pair of vertical bars has been classified; and code to determine whether and where to cut between the first pair of vertical bars based on at least one of the classification and the comparison between the first pair of vertical bars and the one of the plural predetermined distances, wherein, for all classifications of the first Pair of vertical bars, when it is determined to cut, the code to determine determines to cut at one of only two locations between the first pair of vertical bars. - View Dependent Claims (62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75)
-
-
76. Computer-executable process steps stored in a computer-readable medium, the computer-executable process steps comprising an optical character recognition method to recognize characters in input image data, the computer-executable process steps comprising:
-
code to input image data including character image data which represents images of characters; code to cut character image data out of the image data; code to perform optical character recognition processing on the cut character image data; code to select unrecognized cut character image data; code to detect a first pair of vertical bars in the unrecognized cut character image data; code to check for a presence of character image data in each of plural discrete horizontal bands located between the first pair of vertical bars; code to classify the first pair of vertical bars into one of plural classes based on which of the plural discrete horizontal bands contain character image data; code to measure a distance between the first pair of vertical bars; code to compare the distance between the first pair of vertical bars to one of plural predetermined distances, the one of the plural predetermined distances corresponding to the class into which the first pair of vertical bars has been classified; code to determine whether and where to cut between the first pair of vertical bars in the unrecognized cut character image data based on at least one of the classification and the comparison between the first pair of vertical bars and the one of the plural predetermined distances, wherein, for all classifications of the first pair of vertical bars, the code to determine determines to cut at one of only two locations between the first pair of vertical bars; code to cut between the first pair of vertical bars in the unrecognized character image data when it is determined to cut between the vertical bars; code to perform optical character recognition processing on newly-cut character image data; and code to output recognized characters. - View Dependent Claims (77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90)
-
-
91. A computer-readable memory medium which stores computer-executable process steps to cut individual characters out of character image data which includes at least two touching characters, the computer-executable process steps comprising:
-
a detecting step to detect a first pair of vertical bars in the character image data; a checking step to check for a presence of image data in each of plural discrete horizontal bands located between the first pair of vertical bars; a classifying step to classify the first pair of vertical bars into one of plural classes based on which of the plural discrete horizontal bands contain image data; a measuring step to measure a distance between the first pair of vertical bars; a comparing step to compare the distance between the first pair of vertical bars to one of plural predetermined distances, the one of the plural predetermined distances corresponding to the class into which the first pair of vertical bars has been classified; and a determining step to determine whether and where to cut between the first pair of vertical bars based on at least one of the classification and the comparison between the first pair of vertical bars and the one of the plural predetermined distances, wherein, for all classifications of the first pair of vertical bars, when it is determined to cut, the determining step determines to cut at one of only two locations between the first pair of vertical bars. - View Dependent Claims (92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105)
-
-
106. A computer-readable memory medium which stores computer-executable process steps to execute an optical character recognition method to recognize characters in input image data, the computer-executable process steps comprising:
-
an inputting step to input image data including character image data which represents images of characters; a first cutting step to cut character image data out of the image data; a first optical character recognition processing step to perform optical character recognition processing on the cut character image data; a selecting step to select unrecognized cut character image data; a detecting step to detect a first pair of vertical bars in the unrecognized cut character image data; a checking step to check for a presence of character image data in each of plural discrete horizontal bands located between the first pair of vertical bars; a classifying step to classify the first pair of vertical bars into one of plural classes based on which of the plural discrete horizontal bands contain character image data; a measuring step to measure a distance between the first pair of vertical bars; a comparing step to compare the distance between the first pair of vertical bars to one of plural predetermined distances, the one of the plural predetermined distances corresponding to the class into which the first pair of vertical bars has been classified; a determining step to determine whether and where to cut between the first pair of vertical bars in the unrecognized cut character image data based on at least one of the classification and the comparison between the first pair of vertical bars and the one of the plural predetermined distances, wherein, for all classifications of the first pair of vertical bars, when it is determined to cut, the determining step determines to cut at one of only two locations between the first pair of vertical bars; a second cutting step to cut between the first pair of vertical bars in the unrecognized character image data when it is determined to cut between the vertical bars; a second optical character recognition processing step to perform optical character recognition processing on newly-cut character image data; and an outputting step to output recognized characters. - View Dependent Claims (107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120)
-
Specification