Table item information extraction with continuous machine learning through local and global models
First Claim
1. A method, comprising:
- displaying, on a user device through a user interface of a bipartite application, a database table and an image, the database table having a plurality of columns, the image containing a table, the bipartite application implemented on the user device and a server machine in a backend of an enterprise computing environment, the displaying performed by a client module of the bipartite application executing on the user device, the client module including a local model of a table auto-completion algorithm, the user interface including a user interface element associated with the table auto-completion algorithm;
responsive to a user selecting the user interface element displayed on the user device, performing, by the client module running the local model of the table auto-completion algorithm;
analyzing a portion of the table highlighted by the user on the user interface, the portion of the table highlighted by the user on the user interface defining initial coordinates on the user interface;
determining a data point for each column of the database table using the initial coordinates;
automatically extracting data points thus determined from the table utilizing the local model;
entering the data points automatically extracted from the table utilizing the local model into the plurality of columns of the database table; and
storing information about the data points in the local model as positive examples;
determining, by the client module running the local model of the table auto-completion algorithm utilizing the positive examples in the local model, a plurality of additional data points in the table;
automatically extracting the plurality of additional data points from the table utilizing the local model and entering the plurality of additional data points extracted from the table utilizing the local model into the plurality of columns of the database table;
receiving, by the client module running the local model of the table auto-completion algorithm, a correction to a data point of the plurality of additional data points automatically extracted from the table utilizing the local model;
correcting the local model utilizing the correction to the data point and including the data point in the local model as a negative example, the correcting performed by the client module running the local model of the table auto-completion algorithm;
automatically continuously extracting table information from the table utilizing the positive and negative examples in the local model until extraction of the table information from the table is completed;
communicating the local model from the client module to a server module of the bipartite application running on the server machine in the backend of the enterprise computing environment, the server module including a global model of a table auto-completion algorithm;
updating the global model of the table auto-completion algorithm utilizing the local model;
automatically extracting table information from a plurality of documents, the automatically extracting performed by the server module executing on the server machine utilizing the global model; and
automatically entering into database fields the table information extracted from the plurality of documents utilizing the global model, the automatically entering performed by the server module executing on the server machine.
2 Assignments
0 Petitions
Accused Products
Abstract
A bipartite application implements a table auto-completion (TAC) algorithm on the client side and the server side. A client module runs a local model of the TAC algorithm on a user device and a server module runs a global model of the TAC algorithm on a server machine. The local model is continuously adapted through on-the-fly training, with as few as a negative example, to perform TAC on the client side, one document at a time. Knowledge thus learned by the local model is used to improve the global model on the server side. The global model can be utilized to automatically and intelligently extract table information from a large number of documents with significantly improved accuracy, requiring minimal human intervention even on complex tables.
-
Citations
20 Claims
-
1. A method, comprising:
-
displaying, on a user device through a user interface of a bipartite application, a database table and an image, the database table having a plurality of columns, the image containing a table, the bipartite application implemented on the user device and a server machine in a backend of an enterprise computing environment, the displaying performed by a client module of the bipartite application executing on the user device, the client module including a local model of a table auto-completion algorithm, the user interface including a user interface element associated with the table auto-completion algorithm; responsive to a user selecting the user interface element displayed on the user device, performing, by the client module running the local model of the table auto-completion algorithm; analyzing a portion of the table highlighted by the user on the user interface, the portion of the table highlighted by the user on the user interface defining initial coordinates on the user interface; determining a data point for each column of the database table using the initial coordinates; automatically extracting data points thus determined from the table utilizing the local model; entering the data points automatically extracted from the table utilizing the local model into the plurality of columns of the database table; and storing information about the data points in the local model as positive examples; determining, by the client module running the local model of the table auto-completion algorithm utilizing the positive examples in the local model, a plurality of additional data points in the table; automatically extracting the plurality of additional data points from the table utilizing the local model and entering the plurality of additional data points extracted from the table utilizing the local model into the plurality of columns of the database table; receiving, by the client module running the local model of the table auto-completion algorithm, a correction to a data point of the plurality of additional data points automatically extracted from the table utilizing the local model; correcting the local model utilizing the correction to the data point and including the data point in the local model as a negative example, the correcting performed by the client module running the local model of the table auto-completion algorithm; automatically continuously extracting table information from the table utilizing the positive and negative examples in the local model until extraction of the table information from the table is completed; communicating the local model from the client module to a server module of the bipartite application running on the server machine in the backend of the enterprise computing environment, the server module including a global model of a table auto-completion algorithm; updating the global model of the table auto-completion algorithm utilizing the local model; automatically extracting table information from a plurality of documents, the automatically extracting performed by the server module executing on the server machine utilizing the global model; and automatically entering into database fields the table information extracted from the plurality of documents utilizing the global model, the automatically entering performed by the server module executing on the server machine. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system, comprising:
-
a processor; a non-transitory computer-readable medium; and stored instructions translatable by the processor to perform; displaying, on a user device, a database table and an image, the database table having a plurality of columns, the image containing a table, the user interface including a user interface element for table auto-completion; responsive to a user selecting the user interface element displayed on the user device, performing; analyzing a portion of the table highlighted by the user on the user interface, the portion of the table highlighted by the user on the user interface defining initial coordinates on the user interface; determining a data point for each column of the database table using the initial coordinates; automatically extracting data points thus determined from the table utilizing the local model; entering the data points automatically extracted from the table utilizing the local model into the plurality of columns of the database table; and storing information about the data points in a local model of a table auto-completion algorithm as positive examples; determining a plurality of additional data points in the table utilizing the positive examples in the local model; automatically extracting the plurality of additional data points from the table utilizing the local model and entering the plurality of additional data points extracted from the table utilizing the local model into the plurality of columns of the database table; receiving a correction to a data point of the plurality of additional data points automatically extracted from the table utilizing the local model; correcting the local model utilizing the correction to the data point and including the data point in the local model as a negative example; automatically continuously extracting table information from the table utilizing the positive and negative examples in the local model until extraction of the table information from the table is completed; updating a global model of the table auto-completion algorithm utilizing the local model; automatically extracting table information from a plurality of documents utilizing the global model; and automatically entering into database fields the table information extracted from the plurality of documents utilizing the global model. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A computer program product comprising a non-transitory computer-readable medium storing instructions translatable by a processor to perform:
-
displaying, on a user device, a database table and an image, the database table having a plurality of columns, the image containing a table, the user interface including a user interface element for table auto-completion; responsive to a user selecting the user interface element displayed on the user device, performing; analyzing a portion of the table highlighted by the user on the user interface, the portion of the table highlighted by the user on the user interface defining initial coordinates on the user interface; determining a data point for each column of the database table using the initial coordinates; automatically extracting data points thus determined from the table utilizing the local model; entering the data points automatically extracted from the table utilizing the local model into the plurality of columns of the database table; and storing information about the data points in a local model of a table auto-completion algorithm as positive examples; determining a plurality of additional data points in the table utilizing the positive examples in the local model; automatically extracting the plurality of additional data points from the table utilizing the local model and entering the plurality of additional data points extracted from the table utilizing the local model into the plurality of columns of the database table; receiving a correction to a data point of the plurality of additional data points automatically extracted from the table utilizing the local model; correcting the local model utilizing the correction to the data point and including the data point in the local model as a negative example; automatically continuously extracting table information from the table utilizing the positive and negative examples in the local model until extraction of the table information from the table is completed; updating a global model of the table auto-completion algorithm utilizing the local model; automatically extracting table information from a plurality of documents utilizing the global model; and automatically entering into database fields the table information extracted from the plurality of documents utilizing the global model. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification