Document classification and labeling using layout graph matching
First Claim
Patent Images
1. A document processing system for use in identifying a segmented document, comprising:
- a data store of layout graph models that are at least one of classified and labeled;
a matching module operable to make a determination of a match between a layout graph sample for the segmented document and a particular layout graph model of said data store, wherein said matching module has a correlator generating an identified, segmented document that is at least one of classified and labeled based on the segmented document, the layout graph model, and the determination of a match.
1 Assignment
0 Petitions
Accused Products
Abstract
A document processing system for use in identifying a segmented document includes a data store of layout graph models that are classified and/or labeled. A matching module makes a determination of a match between a layout graph sample for the segmented document and a particular layout graph model. The matching module uses a correlator to generate an identified, segmented document that is classified and/or labeled based on the segmented document, the layout graph model, and the determination of a match.
81 Citations
33 Claims
-
1. A document processing system for use in identifying a segmented document, comprising:
-
a data store of layout graph models that are at least one of classified and labeled;
a matching module operable to make a determination of a match between a layout graph sample for the segmented document and a particular layout graph model of said data store, wherein said matching module has a correlator generating an identified, segmented document that is at least one of classified and labeled based on the segmented document, the layout graph model, and the determination of a match. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of classifying and labeling a segmented document, comprising:
-
receiving a layout graph sample for the segmented document;
making a determination of a match between the layout graph sample and a layout graph model that is at least one of classified and labeled; and
generating an identified, segmented document that is at least one of classified and labeled based on the segmented document, the layout graph model, and the determination of a match. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method of building a labeled, layout graph model for a class of documents, comprising:
-
receiving segmentation results of at least one segmentation of at least one document of the class of documents;
instantiating nodes to represent document segments of a page for the class of documents based on the segmentation results, wherein the nodes store information identifying characteristics of the represented document segments; and
instantiating edges relating nodes to one another based on the segmentation results, wherein the edges store information identifying spatial inter-relation of the document segments represented by the nodes. - View Dependent Claims (21, 22, 23, 24, 25, 26)
-
-
27. A method of making a match between layout graph models for use with classifying and labeling documents, comprising:
-
receiving a layout graph sample;
comparing the layout graph sample to at least one layout graph model that is at least one of classified and labeled; and
finding a best match between the layout graph sample and a particular layout graph model. - View Dependent Claims (28, 29, 30, 31, 32, 33)
-
Specification