Apparatus and method of fusing the outputs of multiple intelligent character recognition (ICR) systems to reduce error rate
First Claim
1. A method of fusing outputs from at least two intelligent character recognition (ICR) systems that operate on image component strings said ICR outputs including sets of candidate character strings having associated confidence values, the permutations of said outputs defining a decision space, the method comprising:
- generating a set of rules that partition said decision space into a plurality of subspaces, each of said rules including a test for determining whether said ICR rules including a test for determining whether said ICR outputs lie within its subspace, a membership function for selecting a fused character string from the sets of candidate character strings, and a confidence equation for computing a fused confidence value wherein said set of rules is generated to reduce an average error rate between said fused character string and said image component strings, and further wherein generating the set of rules comprises;
(a) selecting a first one of the rules for said set;
(b) applying said rule to known training data to produce said fused character strings and their confidence values;
(c) comparing said fused character strings to the known training data to determine the average error rate;
(d) adjusting the rule'"'"'s confidence equation to reduce said average error rate; and
(e) selecting the next one of the rules from said set until the set is exhausted and repeating steps b through e;
in response to successive outputs from said ICR systems, performing said tests until one is satisfied to select the applicable rule;
executing the membership function for the applicable rule to select the fused character string;
computing the confidence equation for the applicable rule to assign the fused confidence value; and
outputting said fused character string and its confidence value.
5 Assignments
0 Petitions
Accused Products
Abstract
An ICR fuser includes a set of fusion rules for combining the outputs of multiple ICR systems. Each ICR system operates on a digital image to extract image component strings and provide a set of multiple candidate character strings and their associated confidence values. Each rule includes a test for determining whether that rule applies to the current ICR system outputs, a membership function for selecting a fused character string, and a confidence equation for assigning it a fused confidence value. The rules partition a decision space into subspaces to reduce the error rate between the fused character string and the image component string.
37 Citations
17 Claims
-
1. A method of fusing outputs from at least two intelligent character recognition (ICR) systems that operate on image component strings said ICR outputs including sets of candidate character strings having associated confidence values, the permutations of said outputs defining a decision space, the method comprising:
-
generating a set of rules that partition said decision space into a plurality of subspaces, each of said rules including a test for determining whether said ICR rules including a test for determining whether said ICR outputs lie within its subspace, a membership function for selecting a fused character string from the sets of candidate character strings, and a confidence equation for computing a fused confidence value wherein said set of rules is generated to reduce an average error rate between said fused character string and said image component strings, and further wherein generating the set of rules comprises; (a) selecting a first one of the rules for said set; (b) applying said rule to known training data to produce said fused character strings and their confidence values; (c) comparing said fused character strings to the known training data to determine the average error rate; (d) adjusting the rule'"'"'s confidence equation to reduce said average error rate; and (e) selecting the next one of the rules from said set until the set is exhausted and repeating steps b through e; in response to successive outputs from said ICR systems, performing said tests until one is satisfied to select the applicable rule; executing the membership function for the applicable rule to select the fused character string; computing the confidence equation for the applicable rule to assign the fused confidence value; and outputting said fused character string and its confidence value. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of recognizing character strings in digital images, comprising:
-
providing a digital image of a text document, wherein said text document is a standardized form on which multiple text strings from known data bases are printed in known positions on the form, said ICR systems extracting said text strings as successive ones of said image component strings; providing a plurality of intelligent character recognition (ICR) systems that segment the digital image into image component strings and from them generate respective sets of candidate character strings having associated confidence values where the permutations of said candidate character strings define a decision space wherein said ICR systems detect individual characters from respective subsets of the ASCII character set to generate the respective sets of candidate character strings and wherein three of said ICR systems detect individual characters from subsets of the ASCII character set including only upper case alpha characters, only lower case alpha characters and mixed upper and lower case alpha characters, respectively, and output their respective sets of candidate character strings in a common case; generating a set of rules that partition said decision space into a plurality of subspaces to reduce an average recognition error rate, each of said ordered rules including a test for determining whether said candidate character strings lie within its subspace, a membership function for selecting a fused character string from the sets of candidate character strings, and a confidence equation for computing a fused confidence value and wherein a first one of said rules includes a test that selects the first rule when one and only one string appears in all three sets, a membership function that selects that one string as the fused character string, and a confidence equation that first selects the maximum confidence value from the three sets and then increases it and wherein a second one of said rules includes a test that selects the second rule when one and only one string appears in the upper and mixed case sets and the lower case set is empty or when one and only one string appears in the lower and mixed case sets and the upper case set is empty, a membership function that selects that one string as the fused character string, and a confidence equation that first selects the maximum confidence value from the sets and then increases it; in response to successive sets of candidate character strings, performing said tests until one is satisfied to select the applicable rule; executing the membership function for the applicable rule to select the fused character string; computing the equation for the applicable rule to assign the fused confidence value; and when said fused confidence value exceeds a threshold, transmitting said fused character string to a data base, otherwise rejecting said fused character string and accepting a user generated string for transmission to said data base. - View Dependent Claims (8, 9, 10, 11)
-
-
12. An intelligent character recognition (ICR) fuser for combining the outputs of a plurality of ICR systems, said ICR outputs including sets of candidate character strings having associated confidence values where the permutations of said ICR outputs define a decision space, comprising:
-
a computer programmed with an ordered set of rules that partition said decision space into a plurality of subspaces to reduce an average recognition error rate, each of said rules including a test for determining whether said ICR outputs lie within its subspace, wherein said tests partition said decision space such that the fused character strings selected from a first one of the subspaces have a low error probability and those selected from a second one of the subspaces have a high error probability, at least one membership function for selecting a fused character string from the sets of candidate text strings, wherein each of said membership functions select the candidate character string having the highest confidence value that satisfies the associated test; and at least two confidence equations for computing a fused confidence value, wherein the confidence equation associated with said first subspace boost the fused confidence value relative to the selected candidate character string'"'"'s confidence value, and the confidence equation associated with second subspace reduces the fused confidence value relative to the selected candidate character string'"'"'s confidence value.
-
-
13. A character recognition system, comprising:
-
an input storage medium for storing a digital image of a text document wherein said text document is a standardized form on which multiple text document is a standardized form on which multiple text strings from known data bases are printed in known positions on the form, said ICR systems extracting said text strings as successive ones of said image component strings; a plurality of intelligent character recognition (ICR) systems that segment said digital image to extract successive image component strings and convert them into sets of candidate character strings having associated confidence values where the permutations of said candidate strings define a decision space and wherein said ICR systems detect individual characters from respective subsets of the ASCII character set to generate the respective sets of candidate character strings and wherein three of said ICRs detect individual characters from subsets of the ASCII character set including only upper case alpha characters, only lower case alpha characters and mixed upper and lower case alpha characters, respectively, and output their respective sets of candidate character strings in a common case; a computer programmed with an ordered set of rules that partition said decision space into a plurality of subspaces to reduce an average recognition error rate, each of said ordered rules including a test for determining whether said sets of candidate character strings lie with its subspace, a membership function for selecting a fused character string from those sets, and a confidence equation for computing a fused confidence value, wherein a first one of said rules includes a test that selects the first rule when one and only one string appears in all three sets, a membership function that selects that one string s the fused character string, and a confidence equation that first selects the maximum confidence value from the three sets and then increases it wherein a second one of said rules includes a test that selects the second rule when one and only one string appears in the upper and mixed case sets and the lower case set is empty or when one and only one string appears in the lower and mixed case sets and the upper case set is empty, a membership function that selects that one string as the fused character string, and a confidence equation that first selects the maximum confidence value from the sets and then increases it; a manual input device for replacing said fused character string with a user generated string when said fused confidence value is less than a threshold value; and an output storage medium for storing said fused strings and said user generated strings. - View Dependent Claims (14, 15, 16, 17)
-
Specification