SYSTEM AND METHOD FOR AUTOMATIC LEARNING OF FUNCTIONS
First Claim
Patent Images
1. A computing system implemented method for learning and incorporating forms in an electronic document preparation system, the method comprising:
- receiving form data having a first data field for which a function needs to be determined;
receiving training set data of a plurality of data values of data fields relating to the first data field;
generating, for the first selected data field, candidate function data including two or more distinct candidate functions, each having one or more operators;
generating, for each generated candidate function of the candidate function data, test data by applying the candidate function to at least a portion of the training set data;
generating matching data indicating how closely the test data matches at least a portion of the training set data;
for at least two candidate functions of the candidate function data having the most desirable fitness function results among the original candidate functions;
splitting, of the at least two candidate functions of the candidate function data, data representing a first candidate function into at least first and second component pieces;
assembling at least the first component piece from candidate function data representing the first candidate function with at least a portion of the candidate function data representing the second candidate function, forming new candidate function data representing a new candidate function;
iterating between at least the splitting and assembling operations until test data of at least one of the new candidate functions matches the training set data within a predefined margin of error or until the matching data reflects that the candidate functions are not being improved in each iteration, as shown by a relatively constant margin of error; and
incorporating candidate function data representing at least one candidate function into an electronic document preparation system.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system learns functions to be associated with data fields of forms to be incorporated into an electronic document preparation system. The functions are essentially sets of operations required to calculate the data field. The method and system receive form data related to a data field that expects data values resulting from performing specific operations. The method and system utilize machine learning and training set data to generate, test, and evaluate candidate functions to determine acceptable functions.
-
Citations
30 Claims
-
1. A computing system implemented method for learning and incorporating forms in an electronic document preparation system, the method comprising:
-
receiving form data having a first data field for which a function needs to be determined; receiving training set data of a plurality of data values of data fields relating to the first data field; generating, for the first selected data field, candidate function data including two or more distinct candidate functions, each having one or more operators; generating, for each generated candidate function of the candidate function data, test data by applying the candidate function to at least a portion of the training set data; generating matching data indicating how closely the test data matches at least a portion of the training set data; for at least two candidate functions of the candidate function data having the most desirable fitness function results among the original candidate functions; splitting, of the at least two candidate functions of the candidate function data, data representing a first candidate function into at least first and second component pieces; assembling at least the first component piece from candidate function data representing the first candidate function with at least a portion of the candidate function data representing the second candidate function, forming new candidate function data representing a new candidate function; iterating between at least the splitting and assembling operations until test data of at least one of the new candidate functions matches the training set data within a predefined margin of error or until the matching data reflects that the candidate functions are not being improved in each iteration, as shown by a relatively constant margin of error; and incorporating candidate function data representing at least one candidate function into an electronic document preparation system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable medium having a plurality of computer-executable instructions which, when executed by a processor, perform a method for learning and incorporating new and/or updated forms in an electronic document preparation system, the instructions comprising:
-
receiving form data having a first data field for which a function needs to be determined; receiving training set data of a plurality of data values of data fields relating to the first data field; generating, for the first selected data field, candidate function data including two or more distinct candidate functions, each having one or more operators; generating, for each generated candidate function, test data by applying the candidate function to at least a portion of the training set data; generating matching data indicating how closely the test data matches at least a portion of the training set data; for at least two candidate functions of the candidate function data having the most desirable fitness function results among the original candidate functions; splitting, of the at least two candidate functions, a first candidate function into at least first and second component pieces; assembling at least the first component piece from the first candidate function with at least a portion of the second candidate function, forming a new candidate function; iterating between at least the splitting and assembling operations until test data from at least one of the new candidate functions matches the training set data within a predefined margin of error or until the matching data reflects that the candidate functions are not being improved in each iteration, as shown by a relatively constant margin of error; and incorporating at least one candidate function into an electronic document management system.
-
-
18. A system for learning and incorporating new and/or updated forms in an electronic document preparation system, the system comprising:
-
at least one processor; and at least one memory coupled to the at least one processor, the at least one memory having stored therein instructions which, when executed by any set of the one or more processors, perform a process including; receiving, with an interface module of a computing system, having a first data field for which a function needs to be determined; receiving, with a data acquisition module of a computing system, training set data of a plurality of data values of data fields relating to the first data field; generating, with the machine learning module, for the first selected data field, candidate function data including two or more distinct candidate functions, each having one or more operators; generating, with the machine learning module, for each generated candidate function, test data by applying the candidate function to at least a portion of the training set data; generating, with the machine learning module, matching data indicating how closely the test data matches at least a portion of the training set data; for at least two candidate functions of the candidate function data having the most desirable fitness function results among the original candidate functions; splitting, using the machine learning module, of the at least two candidate functions, a first candidate function into at least first and second component pieces; assembling, using the machine learning module, at least the first component piece from the first candidate function with at least a portion of the second candidate function, forming a new candidate function; iterating, using the machine learning module, between at least the splitting and assembling operations until test data from at least one of the new candidate functions matches the training set data within a predefined margin of error or until the matching data reflects that the candidate functions are not being improved in each iteration, as shown by a relatively constant margin of error; and incorporating, using the machine learning module, at least one candidate function into an electronic document management system. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification