Key character extraction and lexicon reduction for cursive text recognition
First Claim
1. A method for lexicon reduction, comprising:
- ascertaining, for each of a plurality of images of features in a line of cursive text, a number indicative of a feature;
inputting the number indicative of each of the features into a feedforward neural network;
computing with the neural network an estimate of a maximum length (lenmax), and a minimum length (lenmin) of the line of cursive text;
comparing lenmax and lenmin to a lexicon entry;
eliminating the lexicon entry if it has more characters than lenmax; and
eliminating the lexicon entry if it has less characters than lenmin, wherein said lenmin and said lenmax comprise a dynamic range that changes from image to image.
0 Assignments
0 Petitions
Accused Products
Abstract
A method, apparatus, and article of manufacture employing lexicon reduction using key characters and a neural network, for recognizing a line of cursive text. Unambiguous parts of a cursive image, referred to as “key characters,” are identified. If the level of confidence that a segment of a line of cursive text is a particular character is higher than a threshold, and is also sufficiently higher than the level of confidence of neighboring segments, then the character is designated as a key character candidate. Key character candidates are then screened using geometric information. The key character candidates that pass the screening are designated key characters. Two-stages of lexicon reduction are employed. The first stage of lexicon reduction uses a neural network to estimate a lower bound and an upper bound of the number of characters in a line of cursive text. Lexicon entries having a total number of characters outside of the bounds are eliminated. For the second stage of lexicon reduction, the lexicon is fitter reduced by comparing character strings using the key characters, with lexicon entries. For each of the key characters in the character strings, it is determined whether there is a mismatch between the key character and characters in a corresponding search range in the lexicon entry. If the number of mismatches for all of the key characters in a search string is greater than (1+(the number of key characters in the search string/4)), then the lexicon entry is eliminated. Accordingly, the invention advantageously accomplishes lexicon reduction, thereby decreasing the time required to recognize a line of cursive text, without reducing accuracy.
-
Citations
12 Claims
-
1. A method for lexicon reduction, comprising:
-
ascertaining, for each of a plurality of images of features in a line of cursive text, a number indicative of a feature;
inputting the number indicative of each of the features into a feedforward neural network;
computing with the neural network an estimate of a maximum length (lenmax), and a minimum length (lenmin) of the line of cursive text;
comparing lenmax and lenmin to a lexicon entry;
eliminating the lexicon entry if it has more characters than lenmax; and
eliminating the lexicon entry if it has less characters than lenmin, wherein said lenmin and said lenmax comprise a dynamic range that changes from image to image. - View Dependent Claims (10)
said comparing determines a distance between said lexicon entry and said lenbest.
-
-
2. A method for lexicon reduction, comprising:
-
ascertaining, for each of a plurality of features in a line of cursive text, a number indicative of a feature;
inputting the number indicative of each of the features into a feedforward neural network;
computing with the neural network an estimate of a maximum length (lenmax), and a minimum length (lenmin) of the line of cursive text;
comparing lenmax and lenmin to a lexicon entry;
eliminating the lexicon entry if it has more characters than lenmax; and
eliminating the lexicon entry if it has less characters than lenmin, wherein the plurality of features comprises;
(a) the number of connected components in the line of cursive text;
(b) the number of graphemes in the line of cursive text;
(c) the number of horizontal transitions in the line of cursive text;
(d) the sum of the number of horizontal transitions in each grapheme in the line of cursive text; and
(e) the average height of the graphemes in the line of cursive text.
-
-
3. A method for lexicon reduction, comprising:
-
ascertaining, for each of a plurality of features in a line of cursive text, a number indicative of a feature;
inputting the number indicative of each of the features into a feedforward neural network;
computing with the neural network an estimate of a maximum length (lenmax), and a minimum length (lenmin) of the line of cursive text;
comparing lenmax and lenmin to a lexicon entry;
eliminating the lexicon entry if it has more characters than lenmax; and
eliminating the lexicon entry if it has less characters than lenmin, wherein the neural network has a lenbest output unit, a lenmax output unit, and the lenmin output unit, and further comprising;
using a difference about 8(lentrue−
lenbest) in a back propagation step in training the lenmin output unit when the length is overestimated by lenbest;
using a difference about 0.25(lentrue−
lenbest) in the back propagation step in training the lenmin output unit when the length is underestimated by lenbest;
using a difference about 0.25(lentrue−
lenbest) in a back propagation step in training a lenmax output unit when the length is overestimated by lenbest;
using a difference about 8(lentrue−
lenbest) in the back propagation step in training the lenmax output unit when the length is underestimated by lenbest.
-
-
4. An article of manufacture comprising a data storage medium tangibly embodying a program of machine-readable instructions executable by a digital processing apparatus to perform a method for lexicon reduction, the method comprising:
-
ascertaining, for each of a plurality of images of features in a line of cursive text, a number indicative of a feature;
inputting the number indicative of each of the features into a feedforward neural network;
computing with the neural network an estimate of a maximum length (lenmax), and a minimum length (lenmin) of the line of cursive text;
comparing lenmax and lenmin to a lexicon entry;
eliminating the lexicon entry if it has more characters than lenmax; and
eliminating the lexicon entry if it has less characters than lenmin, wherein said lenmin and said lenmax comprise a dynamic range that changes from image to image. - View Dependent Claims (11)
said comparing determines a distance between said lexicon entry and said lenbest.
-
-
5. An article of manufacture comprising a data storage medium tangibly embodying a program of machine-readable instructions executable by a digital processing apparatus to perform a method for lexicon reduction, the method comprising:
-
ascertaining, for each of a plurality of features in a line of cursive text, a number indicative of a feature;
inputting the number indicative of each of the features into a feedforward neural network;
computing with the neural network an estimate of a maximum length (lenmax), and a minimum length (lenmin) of the line of cursive text;
comparing lenmax and lenmin to a lexicon entry;
eliminating the lexicon entry if it has more characters than lenmax; and
eliminating the lexicon entry if it has less characters than lenmin, wherein the plurality of features comprises;
(a) the number of connected components in the line of cursive text;
(b) the number of graphemes in the line of cursive text;
(c) the number of horizontal transitions in the line of cursive text;
(d) the sum of the number of horizontal transitions in each grapheme in the line of cursive text; and
(e) the average height of the graphemes in the line of cursive text.
-
-
6. An article of manufacture comprising a data storage medium tangibly embodying a program of machine-readable instructions executable by a digital processing apparatus to perform a method for lexicon reduction, the method comprising:
-
ascertaining, for each of a plurality of features in a line of cursive text, a number indicative of a feature;
inputting the number indicative of each of the features into a feedforward neural network;
computing with the neural network an estimate of a maximum length (lenmax), and a minimum length (lenmin) of the line of cursive text;
comparing lenmax and lenmin to a lexicon entry;
eliminating the lexicon entry if it has more characters than lenmax; and
eliminating the lexicon entry if it has less characters than lenmin, wherein the neural network has a lenbest output unit, a lenmax output unit, and a lenmin output unit, the method further comprising;
using a difference about 8(lentrue−
lenbest) in a back propagation step in training the lenmin output unit when the length is overestimated by lenbest;
using a difference about 0.25(lentrue−
lenbest) in the back propagation step in training the lenmin output unit when the length is underestimated by lenbest;
using a difference about 0.25(lentrue−
lenbest in the back propagation step in training the lenmax output unit when the length is overestimated by lenbest; and
using a difference about 8(lentrue−
lenbest, in the back propagation step in training the lenmax output unit when the length is underestimated by lenbest.
-
-
7. A digital data processing apparatus, programmed to reduction, comprising:
-
ascertaining, for each of a plurality of images of features in a line of cursive text, a number indicative of a feature;
inputting the number indicative of each of the features into a feedforward neural network;
computing with the neural network an estimate of a maximum length (lenmax), and a minimum length (lenmin) of the line of cursive text;
comparing lenmax and lenmin to a lexicon entry;
eliminating the lexicon entry if it has more characters than lenmax; and
eliminating the lexicon entry if it has less characters than lenmin, wherein said lenmin and said lenmax comprise a dynamic range that changes from image to image. - View Dependent Claims (12)
said comparing determines a distance between said lexicon entry and said lenbest.
-
-
8. A digital data processing apparatus, programmed to reduction, comprising:
-
ascertaining, for each of a plurality of features in a line of cursive text, a number indicative of a feature;
inputting the number indicative of each of the features into a feedforward neural network;
computing with the neural network an estimate of a maximum length (lenmax), and a minimum length (lenmin) of the line of cursive text;
comparing lenmax and lenmin to a lexicon entry;
eliminating the lexicon entry if it has more characters than lenmax; and
eliminating the lexicon entry if it has less characters than lenmin, wherein the plurality of features comprises;
(a) the number of connected components in the line of cursive text;
(b) the number of graphemes in the line of cursive text;
(c) the number of horizontal transitions in the line of cursive text;
(d) the sum of the number of horizontal transitions in each grapheme in the line of cursive text; and
(e) the average height of the graphemes in the line of cursive text.
-
-
9. A digital data processing apparatus, programmed to reduction, comprising:
-
ascertaining, for each of a plurality of features in a line of cursive text, a number indicative of a feature;
inputting the number indicative of each of the features into a feedforward neural network;
computing with the neural network an estimate of a maximum length (lenmax), and a minimum length (lenmin) of the line of cursive text;
comparing lenmax and lenmin to a lexicon entry;
eliminating the lexicon entry if it has more characters than lenmax; and
eliminating the lexicon entry if it has less characters than lenmin, wherein the neural network has a lenbest output unit, a lenmax output unit, and a lenmin. output unit, the method further comprising;
using a difference about 8(lentrue−
lenbest) in a back propagation step in training the lenmin output unit when the length is overestimated by lenbest;
using a difference about 0.25(lentrue−
lenbest) in the back propagation step in training the lenmin output unit when the length is underestimated by lenbest; and
using a difference about 0.25(lentrue−
lenbest) in the back propagation step in training the lenmax output unit when the length is overestimated by lenbest; and
using a difference about 8(lentrue−
lenbest) in a back propagation step in training the lenmax output unit when the length is underestimated by lenbest.
-
Specification