Creating flexible structure descriptions of documents with repetitive non-regular structures
First Claim
1. A method for creating a flexible structure description, the method comprising:
- receiving an image of a document of a particular document type that contains a table;
receiving an entry describing an item in the table;
searching for title elements based upon the entry;
detecting data fields and anchor elements for the entry;
generating, using a processor, a flexible structure description for the particular document type that includes a set of search elements for each data field in the image of the document and the title elements;
matching the flexible structure description against the image; and
extracting data from the image based upon the matching of the flexible structure description against the image.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are systems, computer-readable mediums, and methods for creating a flexible structure description. To create the flexible structure description an image of a document of a particular document type that contains a table is received. An entry describing an item in the table is received. Title elements within the document are searched for based upon the entry. Data fields and anchor elements are detected for the entry. A flexible structure description for the particular document type is generated that includes a set of search elements for each data field in the image of the document and the title elements. The flexible structure description is matched against the image. Data from the image is extracted based upon the matching of the flexible structure description against the image.
30 Citations
20 Claims
-
1. A method for creating a flexible structure description, the method comprising:
-
receiving an image of a document of a particular document type that contains a table; receiving an entry describing an item in the table; searching for title elements based upon the entry; detecting data fields and anchor elements for the entry; generating, using a processor, a flexible structure description for the particular document type that includes a set of search elements for each data field in the image of the document and the title elements; matching the flexible structure description against the image; and extracting data from the image based upon the matching of the flexible structure description against the image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for creating a flexible structure description, the system comprising:
one or more electronic processors configured to; receive an image of a document of a particular document type that contains a table; receive an entry describing an item in the table; search for title elements based upon the entry; detect data fields and anchor elements for the entry; generate a flexible structure description for the particular document type that includes a set of search elements for each data field in the image of the document and the title elements; match the flexible structure description against the image; and extract data from the image based upon the matching of the flexible structure description against the image. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
17. A non-transitory computer-readable medium having instructions stored thereon to create a flexible structure description, the instructions comprising:
-
instructions to receive an image of a document of a particular document type that contains a table; instructions to receive an entry describing an item in the table; instructions to search for title elements based upon the entry; instructions to detect data fields and anchor elements for the entry; instructions to generate a flexible structure description for the particular document type that includes a set of search elements for each data field in the image of the document and the title elements; instructions to match the flexible structure description against the image; and instructions to extract data from the image based upon the matching of the flexible structure description against the image. - View Dependent Claims (18, 19, 20)
-
Specification