×

Method for graph-based table recognition

  • US 5,841,900 A
  • Filed: 01/11/1996
  • Issued: 11/24/1998
  • Est. Priority Date: 01/11/1996
  • Status: Expired due to Term
First Claim
Patent Images

1. A graph-based method for recognizing tables present in a document represented as digital image data, including the steps of:

  • segmenting the digital image data to identify textual and image entities within the document;

    building a layout graph of the document using the entities;

    automatically tagging each of the text entities with a label from a document node alphabet to produce a labeled graph;

    automatically rewriting the labeled graph, by manipulating the text entities therein, using at least one rewriting rule, to identify the logical structure of the document, wherein the logical structure includes an identified table and the labeled graph is a host graph and where the step of automatically rewriting the labeled graph by manipulating the text entities comprises the steps ofidentifying at least two entities in the layout graph that are isomorphic instances of subgraphs, andreplacing, in the host graph, the at least two entities that are isomorphic instances of the subgraphs with a nonterminal entity.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×