Feature density object classification, systems and methods
First Claim
1. An object data processing system comprising:
- at least one processor configured to execute;
at least one implementation of a plurality of recognition algorithms stored on at least one non-transitory computer-readable storage medium, each recognition algorithm having feature density selection criteria; and
data preprocessing code executed by at least one processor, the data preprocessing code comprising an invariant feature identification algorithm and configured to;
obtain a digital representation of a scene, the scene comprising one or more textual media;
generate a set of invariant features by applying the invariant feature identification algorithm to the digital representation;
cluster the set of invariant features into regions of interest in the digital representation of the scene, each region of interest having a region feature density;
classify, by region classifier code, at least one of the regions of interest according to object type as a function of attributes derived from the region feature density and the digital representation, wherein the at least one of the classified regions of interest corresponds to text; and
use a classification result corresponding to the at least one of the regions of interest to classify another of the regions of interest according to object type, wherein the another of the regions of interest corresponds to a region of interest for images.
5 Assignments
0 Petitions
Accused Products
Abstract
A system capable of determining which recognition algorithms should be applied to regions of interest within digital representations is presented. A preprocessing module utilizes one or more feature identification algorithms to determine regions of interest based on feature density. The preprocessing modules leverages the feature density signature for each region to determine which of a plurality of diverse recognition modules should operate on the region of interest. A specific embodiment that focuses on structured documents is also presented. Further, the disclosed approach can be enhanced by addition of an object classifier that classifies types of objects found in the regions of interest.
51 Citations
22 Claims
-
1. An object data processing system comprising:
at least one processor configured to execute; at least one implementation of a plurality of recognition algorithms stored on at least one non-transitory computer-readable storage medium, each recognition algorithm having feature density selection criteria; and data preprocessing code executed by at least one processor, the data preprocessing code comprising an invariant feature identification algorithm and configured to; obtain a digital representation of a scene, the scene comprising one or more textual media; generate a set of invariant features by applying the invariant feature identification algorithm to the digital representation; cluster the set of invariant features into regions of interest in the digital representation of the scene, each region of interest having a region feature density; classify, by region classifier code, at least one of the regions of interest according to object type as a function of attributes derived from the region feature density and the digital representation, wherein the at least one of the classified regions of interest corresponds to text; and use a classification result corresponding to the at least one of the regions of interest to classify another of the regions of interest according to object type, wherein the another of the regions of interest corresponds to a region of interest for images. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
21. An object data processing system comprising:
at least one processor configured to execute; at least one implementation of a plurality of recognition algorithms stored on at least one non-transitory computer-readable storage medium, each recognition algorithm having feature density selection criteria; and data preprocessing code executed by at least one processor, the data preprocessing code comprising an invariant feature identification algorithm and configured to; obtain a digital representation of a scene, the scene comprising one or more textual media; generate a set of invariant features by applying the invariant feature identification algorithm to the digital representation; cluster the set of invariant features into regions of interest in the digital representation of the scene, each region of interest having a region feature density; classify, by region classifier code, at least one of the regions of interest according to object type as a function of attributes derived from the region feature density and the digital representation;
wherein the at least one of the classified regions of interest corresponds to text; anduse a classification result corresponding to the at least one of the regions of interest to classify another of the regions of interest according to object type, wherein the another of the regions of interest corresponds to a region of interest for images; assign each region of interest at least one recognition algorithm from at least one implementation of a plurality of diverse recognition algorithms as a function of the region feature density of each region of interest and the feature density selection criteria of the at least one implementation of a plurality of diverse recognition algorithms; and configure the assigned recognition algorithms to process their respective regions of interest, wherein preprocessing code, based on the feature density selection criteria, determines that an OCR algorithm is applicable to the text, and that other recognition algorithms are applicable to aspects of the photographs and to logos.
-
22. A device comprising:
at least one processor configured to execute; at least one implementation of a plurality of recognition algorithms stored on at least one non-transitory computer-readable storage medium, each recognition algorithm having feature density selection criteria; and data preprocessing code executed by at least one processor, the data preprocessing code comprising an invariant feature identification algorithm and configured to; obtain a digital representation of a scene, the scene comprising one or more textual media; generate a set of invariant features by applying the invariant feature identification algorithm to the digital representation; cluster the set of invariant features into regions of interest in the digital representation of the scene, each region of interest having a region feature density; and classify, by region classifier code, at least one of the regions of interest according to object type as a function of attributes derived from the region feature density and the digital representation, wherein the at least one of the classified regions of interest corresponds to text; and use a classification result corresponding to the at least one of the regions of interest to classify another of the regions of interest according to object type, wherein the another of the regions of interest corresponds to a region of interest for images.
Specification