Real-time recognition of mixed source text
First Claim
1. An optical character recognition (OCR) system for the real-time classification of text from a region of interest within an image sample, comprising:
- a feature extractor that extracts feature data associated with a plurality of region features from the region of interest;
a neural network preclassifier that selects one of a plurality of associated source classes for the region of interest according to the extracted feature data; and
a plurality of classification systems, each of the plurality of classification systems being associated with one of the plurality of source classes and being operative to classify individual characters within the region of interest when the associated source class of the classification system is selected.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and computer program products are disclosed for the real-time classification of text from a region of interest within an image sample. A feature extractor extracts feature data associated with a plurality of region features from the region of interest. The plurality of region features are selected as to minimize the time necessary for feature extraction. A neural network preclassifier selects one of a plurality of associated source classes for the region of interest according to the extracted feature data. A plurality of classification systems are each associated with one of the plurality of source classes. Each of the plurality of classification systems are operative to classify individual characters within the region of interest when the associated source class of the classification system is selected.
-
Citations
20 Claims
-
1. An optical character recognition (OCR) system for the real-time classification of text from a region of interest within an image sample, comprising:
-
a feature extractor that extracts feature data associated with a plurality of region features from the region of interest;
a neural network preclassifier that selects one of a plurality of associated source classes for the region of interest according to the extracted feature data; and
a plurality of classification systems, each of the plurality of classification systems being associated with one of the plurality of source classes and being operative to classify individual characters within the region of interest when the associated source class of the classification system is selected. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product, implemented on a computer readable medium and operative in a data processing system, for the real-time classification of text within a region of interest, comprising:
-
a feature extraction component that extracts feature values associated with a plurality of features relating to the region of interest from an image sample;
a preclassifier that selects one of a plurality of associated source classes for the region of interest according to the extracted feature values; and
a plurality of classifiers, each of the plurality of classifiers being associated with one of the plurality of source classes and being operative to classify individual characters within the region of interest when the associated source class of the classifier is selected;
wherein the feature extractor and preclassifier are configured such that the feature extractor and the preclassifier can operate to select one of the plurality of associated source classes within a predetermined period of time. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A method for classifying text from a region of interest in real-time comprising:
-
identifying a region of interest within a scanned image;
extracting a plurality of feature values, associated with a plurality of region features, from the region of interest;
classifying the region of interest into one of a plurality of source classes at a neural network preclassifier according to the extracted feature values;
selecting one of a plurality of classification systems according to the source class associated with the region of interest; and
classifying individual characters within the region of interest at the selected classification system. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification