Error correction in tables using discovered functional dependencies
First Claim
1. A computer program product comprising a computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to:
- receive a natural language document comprising tabular data;
analyze the tabular data within the natural language document to identify at least one portion of the tabular data comprising an erroneous or missing data value;
determine, for the at least one portion, a functional dependency of the at least one portion of the tabular data on one or more other portions of the tabular data by reference to the natural language document;
determine for the at least one portion, a correct data value for the erroneous or missing data value of the at least one portion of the tabular data based on the determined functional dependency of the at least one portion;
modify the tabular data to replace the erroneous or missing data value with the correct data value and thereby generate a modified tabular data structure; and
perform a processing operation on the modified tabular data structure to generate a resulting output.
1 Assignment
0 Petitions
Accused Products
Abstract
Mechanisms are provided for performing tabular data correction in a document. Tabular data is received and analyzed to identify at least one portion of the tabular data having an erroneous/missing data value. A functional dependency of the at least one portion of the tabular data on one or more other portions of the tabular data is determined. A correct data value for the erroneous or missing data value of the at least one portion of the tabular data is determined based on the functional dependency of the at least one portion. In addition, the tabular data is modified to replace the erroneous or missing data value with the correct data value and thereby generate a modified table data. A processing operation is then performed on the modified table data to generate a resulting output.
53 Citations
20 Claims
-
1. A computer program product comprising a computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to:
-
receive a natural language document comprising tabular data; analyze the tabular data within the natural language document to identify at least one portion of the tabular data comprising an erroneous or missing data value; determine, for the at least one portion, a functional dependency of the at least one portion of the tabular data on one or more other portions of the tabular data by reference to the natural language document; determine for the at least one portion, a correct data value for the erroneous or missing data value of the at least one portion of the tabular data based on the determined functional dependency of the at least one portion; modify the tabular data to replace the erroneous or missing data value with the correct data value and thereby generate a modified tabular data structure; and perform a processing operation on the modified tabular data structure to generate a resulting output. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus comprising:
-
a processor; and a memory coupled to the processor, wherein the memory comprises instructions which, when executed by the processor, cause the processor to; receive a natural language document comprising tabular data; analyze the tabular data within the natural language document to identify at least one portion of the tabular data comprising an erroneous or missing data value; determine, for the at least one portion, a functional dependency of the at least one portion of the tabular data on one or more other portions of the tabular data by reference to the natural language document; determine for the at least one portion, a correct data value for the erroneous or missing data value of the at least one portion of the tabular data based on the determined functional dependency of the at least one portion; modify the tabular data to replace the erroneous or missing data value with the correct data value and thereby generate a modified tabular data structure; and perform a processing operation on the modified tabular data structure to generate a resulting output. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification