Error correction in tables using discovered functional dependencies
First Claim
1. A method, in a data processing system comprising a processor and a memory, for performing tabular data correction in a document, the method comprising:
- receiving, by the data processing system, a natural language document comprising tabular data;
analyzing, by the data processing system, the tabular data within the natural language document to identify at least one portion of the tabular data comprising an erroneous or missing data value;
determining, by the data processing system, for the at least one portion, a functional dependency of the at least one portion of the tabular data on one or more other portions of the tabular data by reference to the natural language document;
determining, by the data processing system, for the at least one portion, a correct data value for the erroneous or missing data value of the at least one portion of the tabular data based on the determined functional dependency of the at least one portion;
modifying, by the data processing system, the tabular data to replace the erroneous or missing data value with the correct data value and thereby generate a modified tabular data structure; and
performing, by the data processing system, a processing operation on the modified tabular data structure to generate an resulting output.
1 Assignment
0 Petitions
Accused Products
Abstract
Mechanisms are provided for performing tabular data correction in a document. Tabular data is received and analyzed to identify at least one portion of the tabular data having an erroneous/missing data value. A functional dependency of the at least one portion of the tabular data on one or more other portions of the tabular data is determined. A correct data value for the erroneous or missing data value of the at least one portion of the tabular data is determined based on the functional dependency of the at least one portion. In addition, the tabular data is modified to replace the erroneous or missing data value with the correct data value and thereby generate a modified table data. A processing operation is then performed on the modified table data to generate a resulting output.
65 Citations
10 Claims
-
1. A method, in a data processing system comprising a processor and a memory, for performing tabular data correction in a document, the method comprising:
-
receiving, by the data processing system, a natural language document comprising tabular data; analyzing, by the data processing system, the tabular data within the natural language document to identify at least one portion of the tabular data comprising an erroneous or missing data value; determining, by the data processing system, for the at least one portion, a functional dependency of the at least one portion of the tabular data on one or more other portions of the tabular data by reference to the natural language document; determining, by the data processing system, for the at least one portion, a correct data value for the erroneous or missing data value of the at least one portion of the tabular data based on the determined functional dependency of the at least one portion; modifying, by the data processing system, the tabular data to replace the erroneous or missing data value with the correct data value and thereby generate a modified tabular data structure; and performing, by the data processing system, a processing operation on the modified tabular data structure to generate an resulting output. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
Specification