Creating flexible structure descriptions
First Claim
Patent Images
1. A method for creating a flexible structure description, the method comprising:
- obtaining an image of a document;
detecting image objects on the image;
matching by a processor the image objects to a predetermined document type, wherein image objects distinguish the document type from other document types, and wherein image objects include anchor elements;
generating by the processor a flexible structure description corresponding to the predetermined document type based on the detected image objects, wherein the flexible structure description includes a set of search elements for each data field in the image of the document, each search element having an associated search criterion;
searching, via a search algorithm, additional document images to determine a respective document type of the additional document images, wherein each of the additional document images are of a document type corresponding to the predetermined document type;
modifying the flexible structure description based on said searching of additional document images, wherein the search algorithm is configured to detect data fields based on the flexible structure description, said data fields corresponding to the predetermined document type; and
repeating said searching and modifying of the flexible structure description until a defined level of precision is achieved or exceeded.
5 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, the invention provides a method, comprising detecting data fields on a scanned document image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training or modifying the flexible document description using, for example, a search algorithm to detect the data fields on additional training images based on the set of search elements.
61 Citations
24 Claims
-
1. A method for creating a flexible structure description, the method comprising:
-
obtaining an image of a document; detecting image objects on the image; matching by a processor the image objects to a predetermined document type, wherein image objects distinguish the document type from other document types, and wherein image objects include anchor elements; generating by the processor a flexible structure description corresponding to the predetermined document type based on the detected image objects, wherein the flexible structure description includes a set of search elements for each data field in the image of the document, each search element having an associated search criterion; searching, via a search algorithm, additional document images to determine a respective document type of the additional document images, wherein each of the additional document images are of a document type corresponding to the predetermined document type; modifying the flexible structure description based on said searching of additional document images, wherein the search algorithm is configured to detect data fields based on the flexible structure description, said data fields corresponding to the predetermined document type; and repeating said searching and modifying of the flexible structure description until a defined level of precision is achieved or exceeded. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer-readable medium having stored thereon a sequence of instructions which when executed by a system, causes the system to perform a method, comprising:
-
obtaining an image of a document; detecting image objects on the image; matching by a processor the image objects to a predetermined document type, wherein image objects distinguish the document type from other document types, and wherein image objects include anchor elements; generating by the processor a flexible structure description corresponding to the predetermined document type based on the detected image objects, wherein the flexible structure description includes a set of search elements for each data field in the image of the document, each search element having an associated search criterion; searching, via a search algorithm, additional document images to determine a respective document type of the additional document images, wherein each of the additional document images are of a document type corresponding to the predetermined document type; modifying the flexible structure description based on said searching of additional document images, wherein the search algorithm is configured to detect data fields based on the flexible structure description, said data fields corresponding to the predetermined document type; and repeating said searching and modifying of the flexible structure description until a defined level of precision is achieved or exceeded. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A system comprising:
-
a processor; and a memory coupled to the processor, the memory storing instructions which when executed by the processor causes the system to perform a method, the method comprising; obtaining an image of a document; detecting image objects on the image; matching by a processor the image objects to a predetermined document type, wherein image objects distinguish the document type from other document types, and wherein image objects include anchor elements; generating by the processor a flexible structure description corresponding to the predetermined document type based on the detected image objects, wherein the flexible structure description includes a set of search elements for each data field in the image of the document, each search element having an associated search criterion; searching, via a search algorithm, additional document images to determine a respective document type of the additional document images, wherein each of the additional document images are of a document type corresponding to the predetermined document type; modifying the flexible structure description based on said searching of additional document images, wherein the search algorithm is configured to detect data fields based on the flexible structure description, said data fields corresponding to the predetermined document type; and repeating said searching and modifying of the flexible structure description until a defined level of precision is achieved or exceeded. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification