Mathematical decomposition of table-structured electronic documents
First Claim
1. A method for automatically mathematically decomposing a table-structured document, the method comprising:
- utilizing mathematical relationships, together with textual and positional clues to the mathematical relationships, in a collaborative manner, to derive a mathematical construct of the table-structured document.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for automatically decomposing table-structured electronic documents are described. The systems and methods of this invention generally comprise utilizing mathematical relationships, together with textual and positional clues to the mathematical relationships, in a collaborative manner, to derive a mathematical construct of the table-structured document. Embodiments of this invention automatically process a multitude of table-structured documents, thereby eliminating the need for human interaction with such documents in many cases and lowering the costs associated with processing such documents.
31 Citations
45 Claims
-
1. A method for automatically mathematically decomposing a table-structured document, the method comprising:
utilizing mathematical relationships, together with textual and positional clues to the mathematical relationships, in a collaborative manner, to derive a mathematical construct of the table-structured document. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. A method for automatically mathematically decomposing a table-structured document, wherein the table-structured document comprises rows of data, the method comprising:
-
identifying each row of data in the table-structured document by aggregating together successively larger sets of values from consecutive rows in the table-structured document, starting at the top of the document and considering possible negative and positive permutations of each value as necessary, to see if the sums thereof are substantially equal to a value in the next consecutive row in the table-structured document, wherein if the sum thereof is substantially equal to the value in the next consecutive row in the table-structured document, the value in that next consecutive row in the table-structured document is identified as a subtotal. - View Dependent Claims (8, 9)
-
-
10. A method for automatically mathematically decomposing a table-structured document comprising multiple tables having predefined mathematical relationships, the method comprising at least one of the following steps:
-
utilizing textual clues and the predefined mathematical relationships between the multiple tables to partition the document into multiple sub-tables;
identifying a value in each sub-table as a grand total for the sub-table utilizing at least one of positional and textual information;
assigning all line items within the sub-table, except the grand total for the sub-table, as children of the sub-table;
pre-identifying subtotals within each table by utilizing available textual clues;
pre-identifying a value sign for each value in each table by utilizing available textual clues;
identifying and validating mathematical relationships between the children in each sub-table by summing together all line items values within the sub-table, except the grand total value for the sub-table, to create a validation sum, and then subtracting a sum of successively larger sets of line item values from the validation sum until the result thereof equals the grand total value for the sub-table, wherein when the result thereof equals the grand total value for the sub-table, the values in the set of line item values are identified as subtotals of the sub-table. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A method for automatically mathematically decomposing a financial statement comprising rows of data, the method comprising:
-
identifying each row of data in the financial statement as an individual line item, a subtotal or a total, by summing together sequentially larger sets of values from consecutive rows, considering possible combinations of positive and negative permutations of each value as necessary, to see if the sum thereof equals the value in the next consecutive row in the financial statement;
wherein if the sum of the set of values equals the value in the next consecutive row in the financial statement, the set of values are identified as individual line items belonging to a category, and wherein the value in the next consecutive row in the financial statement is identified as the subtotal of the category. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
-
24. A system for automatically mathematically decomposing a table-structured document, the system comprising:
a means for utilizing mathematical relationships, together with textual and positional clues to the mathematical relationships, in a collaborative manner, to derive a mathematical construct of the table-structured document. - View Dependent Claims (25, 26, 27, 28, 29)
-
30. A system for automatically mathematically decomposing a table-structured document, wherein the table-structured document comprises rows of data, the system comprising:
-
a means for identifying each row of data in the table-structured document by aggregating together successively larger sets of values from consecutive rows in the table-structured document, starting at the top of the document and considering the possible negative and positive permutations of each value as necessary, to see if the sums thereof are substantially equal to a value in the next consecutive row in the table-structured document, wherein if the sum thereof is substantially equal to the value in the next consecutive row in the table-structured document, the value in that next consecutive row in the table-structured document is identified as a subtotal. - View Dependent Claims (31, 32)
-
-
33. A system for automatically mathematically decomposing a table-structured document comprising multiple tables having predefined mathematical relationships, the system comprising at least one of the following:
-
a means for utilizing textual clues and the predefined mathematical relationships between the multiple tables to partition the document into multiple sub-tables;
a means for identifying a value in each sub-table as a grand total for the sub-table utilizing at least one of positional and textual information;
a means for assigning all line items within the sub-table, except the grand total for the sub-table, as children of the sub-table;
a means for pre-identifying subtotals within each table by utilizing available textual clues;
a means for pre-identifying a value sign for each value in each table by utilizing available textual clues;
a means for identifying and validating mathematical relationships between the children in each sub-table by summing together all line items values within the sub-table, except the grand total value for the sub-table, to create a validation sum, and then subtracting a sum of successively larger sets of line item values from the validation sum until the result thereof equals the grand total value for the sub-table, wherein when the result thereof equals the grand total value for the sub-table, the values in the set of line item values are identified as subtotals of the sub-table. - View Dependent Claims (34, 35, 36, 37, 38)
-
-
39. A system for automatically mathematically decomposing a financial statement comprising rows of data, the system comprising:
-
a means for identifying each row of data in the financial statement as an individual line item, a subtotal or a total, by summing together sequentially larger sets of values from consecutive rows, considering the possible combinations of positive and negative permutations of each value as necessary, to see if the sum thereof equals the value in the next consecutive row in the financial statement;
wherein if the sum of the set of values equals the value in the next consecutive row in the financial statement, the set of values are identified as individual line items belonging to a category, and wherein the value in the next consecutive row in the financial statement is identified as the subtotal of the category. - View Dependent Claims (40, 41, 42, 43, 44, 45)
-
Specification