Document Classification with Prominent Objects
First Claim
1. In a computing system environment, a method for classifying whether or not an unknown input document belongs to a group with one or more reference documents, wherein digital images correspond to each of the unknown input document and the one or more reference documents, comprising:
- applying edge detection to the digital images to detect contours of pluralities of image objects;
approximating the contours of the image objects to a nearest polygon thereby defining pluralities of polygons;
extracting prominent objects from one or more of the polygons to derive a collection of features that together identify the one or more reference documents; and
comparing to the collection of features at least one prominent object from the digital image corresponding to the unknown input document to determine inclusion or not of the unknown input document with the one or more reference documents.
5 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods classify unknown documents in a group or not with reference document(s). Documents get scanned into digital images. Applying edge detection allows the detection of contours defining pluralities of image objects. The contours are approximated to a nearest polygon. Prominent objects get extracted from the polygons and derive a collection of features that together identify the reference document(s). Comparing the collection of features to those of an unknown image determine or not inclusion of the unknown with the reference(s). Embodiments typify collections of features, classification acceptance or not, application of algorithms, and imaging devices with scanners, to name a few.
-
Citations
20 Claims
-
1. In a computing system environment, a method for classifying whether or not an unknown input document belongs to a group with one or more reference documents, wherein digital images correspond to each of the unknown input document and the one or more reference documents, comprising:
-
applying edge detection to the digital images to detect contours of pluralities of image objects; approximating the contours of the image objects to a nearest polygon thereby defining pluralities of polygons; extracting prominent objects from one or more of the polygons to derive a collection of features that together identify the one or more reference documents; and comparing to the collection of features at least one prominent object from the digital image corresponding to the unknown input document to determine inclusion or not of the unknown input document with the one or more reference documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 13, 14, 15)
-
- 11. The method of 1, further including ranking a comparison of the at least one prominent object to more than one said collection of features.
-
16. In an imaging device having a scanner and a controller for executing instructions responsive thereto, a method for classifying whether or not an unknown input document belongs to a group with one or more reference documents, comprising:
-
receiving at the controller a digital image from the scanner for each of the unknown input document and the one or more reference documents; applying edge detection to the digital images to detect contours of pluralities of image objects; approximating the contours of the image objects to a nearest polygon thereby defining pluralities of polygons; and extracting prominent objects from one or more of the polygons to derive a collection of features that together identify the one or more reference documents. - View Dependent Claims (17)
-
-
18. A method for classifying whether or not an unknown input document belongs to a group with one or more reference documents, wherein digital images correspond to each of the unknown input document and the one or more reference documents, comprising:
-
applying edge detection to the digital images to detect contours of pluralities of image objects; and determining features of prominent objects from the pluralities of image objects to derive a collection of features that together identify the one or more reference documents. - View Dependent Claims (19, 20)
-
Specification