Systems and methods for detecting text
First Claim
1. A system that facilitates detecting text in data, comprising:
- an input component that receives data; and
a classification component that automatically detects text in the data via a trained transductive classifier employed in connection with a trained boosted classifier.
2 Assignments
0 Petitions
Accused Products
Abstract
The subject invention relates to facilitating text detection. The invention employs a boosted classifier and a transductive classifier to provide accurate and efficient text detection systems and/or methods. The boosted classifier is trained through features generated from a set of training connected components and labels. The boosted classifier utilizes the features to classify the training connected components, wherein inferred labels are conveyed to a transductive classifier, which generates additional properties. The initial set of features and the properties are utilized to train the transductive classifier. Upon training, the system and/or methods can be utilized to detect text in data under text detection, wherein unlabeled data is received, and connected components are extracted therefrom and utilized to generate corresponding feature vectors, which are employed to classify the connected components using the initial boosted classifier. Inferred labels are utilized to generate properties, which are utilized along with the initial feature vectors to classify each connected component using the transductive classifier.
150 Citations
20 Claims
-
1. A system that facilitates detecting text in data, comprising:
-
an input component that receives data; and
a classification component that automatically detects text in the data via a trained transductive classifier employed in connection with a trained boosted classifier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for detecting text, comprising:
-
identifying one or more connected components associated with the unlabeled data under text detection;
utilizing the connected components to extract a feature vector for each connected component;
classifying each connected component represented by its respective feature vector;
employing inferred labels to bin the connected components across a plurality of bins;
computing properties for each bin; and
utilizing the feature vectors and corresponding properties to classify the connected components. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system for training a text detector, comprising:
-
means for training a boosted classifier and a transductive classifier with connected components and corresponding text and non-text labels for text detection; and
means for employing the boosted classifier in connection with the transductive classifier to detect text within unlabeled data.
-
Specification