Graphical syntax analysis of tables through tree rewriting
First Claim
Patent Images
1. A storage medium storing software that is executable on a computer or digital device to perform a method for determining a table structure, the method comprising:
- deriving a spatially ordered sequence of rectangular cells disposed in a two-dimensional region; and
parsing the spatially ordered sequence of rectangular cells in accordance with a two-dimensional structural grammar having terminal elements corresponding to cells and non-terminal elements corresponding to structural relationship operators, the parsing producing a grammatical expression with the cells represented by terminal elements and structural relationships represented by non-terminal elements, wherein the grammatical expression is a table structure comprising a tree in which the cells are represented by leaf nodes of the tree and the structural relationships are represented by non-leaf nodes of the tree, and the parsing includes;
determining a table structure respective to a first two cells of a spatially ordered sequence of rectangular cells, the table structure including elements indicative of the first two cells and at least one element indicative of a structural relationship between the first two cells,defining a minimum rectangular bounding box containing the cells of the table structure,updating the table structure with additional structure including an element indicative of a next cell of the spatially ordered sequence of rectangular cells added as a leaf node and at least one element indicative of a structural relationship between the next cell and the minimum rectangular bounding box added as a non-leaf node, andrepeating the defining and updating until the cells of the ordered sequence of rectangular cells are exhausted.
1 Assignment
0 Petitions
Accused Products
Abstract
To determine a table structure, a spatially ordered sequence of rectangular cells (42) disposed in a two dimensional region is derived. The ordered sequence of rectangular cells is parsed in accordance with a two-dimensional structural grammar (54) having terminal elements corresponding to cells and non-terminal elements corresponding to structural relationship operators. The parsing produces a grammatical expression (52) with the cells represented by terminal elements and structural relationships represented by non-terminal elements.
18 Citations
13 Claims
-
1. A storage medium storing software that is executable on a computer or digital device to perform a method for determining a table structure, the method comprising:
-
deriving a spatially ordered sequence of rectangular cells disposed in a two-dimensional region; and parsing the spatially ordered sequence of rectangular cells in accordance with a two-dimensional structural grammar having terminal elements corresponding to cells and non-terminal elements corresponding to structural relationship operators, the parsing producing a grammatical expression with the cells represented by terminal elements and structural relationships represented by non-terminal elements, wherein the grammatical expression is a table structure comprising a tree in which the cells are represented by leaf nodes of the tree and the structural relationships are represented by non-leaf nodes of the tree, and the parsing includes; determining a table structure respective to a first two cells of a spatially ordered sequence of rectangular cells, the table structure including elements indicative of the first two cells and at least one element indicative of a structural relationship between the first two cells, defining a minimum rectangular bounding box containing the cells of the table structure, updating the table structure with additional structure including an element indicative of a next cell of the spatially ordered sequence of rectangular cells added as a leaf node and at least one element indicative of a structural relationship between the next cell and the minimum rectangular bounding box added as a non-leaf node, and repeating the defining and updating until the cells of the ordered sequence of rectangular cells are exhausted. - View Dependent Claims (2, 3)
-
-
4. A method for determining a table structure, the method comprising:
-
deriving a spatially ordered sequence of rectangular cells disposed in a two-dimensional region; and parsing the spatially ordered sequence of rectangular cells in accordance with a two-dimensional structural grammar having terminal elements corresponding to cells and non-terminal elements corresponding to structural relationship operators, the parsing producing a grammatical expression with the cells represented by terminal elements and structural relationships represented by non-terminal elements; wherein each cell of the derived spatially ordered sequence of rectangular cells is represented by two points corresponding to opposing corners of the cell, and the ordered sequence of rectangular cells employs a spatial ordering based on coordinates of the points wherein each point is represented by two rectilinear coordinates, and the spatial ordering is in accordance with;
<
p1,p2>
<
p3,p4>
(p1∝
p3)(p1=p3^(p2p4))where <
p1,p2> and
<
p3,p4>
denote two cells, p1, p2, p3, p4 denote points representing said cells, and the spatial ordering of points is in accordance with;
p1p2=(p1.α
<
p2.α
)(p1.α
=p2.α
^p1.β
<
p2.β
)where the notation “
p.α
”
denotes an α
-coordinate of a point denoted by “
p”
, and the notation “
p.β
”
denotes a β
-coordinate of the point denoted by “
p”
, the α and
β
coordinates being mutually orthogonal.- View Dependent Claims (5)
-
-
6. A method for determining a table structure, the method comprising:
-
deriving a spatially ordered sequence of rectangular cells disposed in a two-dimensional region; and parsing the spatially ordered sequence of rectangular cells in accordance with a two-dimensional structural grammar having terminal elements corresponding to cells and non-terminal elements corresponding to structural relationship operators, the parsing producing a grammatical expression with the cells represented by terminal elements and structural relationships represented by non-terminal elements; wherein the structural relationship operators of the grammar include at least AboveLeft, AboveRight, AboveEq, LeftUpper, LeftBottom, and LeftEq, two-dimensional structural relationship operators. - View Dependent Claims (7, 8, 9)
-
-
10. A storage medium storing software that is executable on a computer or digital device to perform a method for determining a table structure, the method comprising:
-
deriving a spatially ordered sequence of rectangular cells disposed in a two-dimensional region; and parsing the spatially ordered sequence of rectangular cells in accordance with a two-dimensional structural grammar having terminal elements corresponding to cells and non-terminal elements corresponding to structural relationship operators, the parsing producing a table structure comprising a grammatical expression with the cells represented by terminal elements and structural relationships represented by non-terminal elements, wherein the parsing comprises; generating an existing grammatical expression including an initial two terminal elements representing a first two cells of the spatially ordered sequence of rectangular cells and a non-terminal element representing a relationship between the initial two terminal elements; extending the existing grammatical expression by adding a next terminal element indicative of a next cell of the spatially ordered sequence of rectangular cells and a non-terminal element indicative of a relationship between the next cell and the table structure represented by the existing grammatical expression; and repeating the extending until the existing grammatical expression includes terminal elements representing all cells of the spatially ordered sequence of rectangular cells. - View Dependent Claims (11, 12, 13)
-
Specification