Standard exact clause detection
First Claim
1. A non-transitory computer readable medium storing program code for determination of standard exact clauses and non-standard clauses from a plurality of documents, the program code comprising instructions that when executed by a processor cause the processor to:
- obtain a primary policy comprising one or more features, a clause example, and a first threshold for use in a semantic language evaluator to generate a plurality of feature replaced clauses by automatically replacing one or more of a plurality of original clauses in a plurality of documents with the one or more features and to compare each of the plurality of feature replaced clauses according to the primary policy to provide a first feature data set comprising standard clauses;
replace, automatically, an available variation of one of the standard clauses in the plurality of documents with a variable;
compare a clause and at least one of the clause example and the variable, the clause obtained from the plurality of documents with the available variation replaced with the variable;
obtain, in response to the comparison, a standard exact clause comprising the clause matching at least one of the clause example and the variable;
obtain a second feature data set encompassing the first feature data set, the second feature data set corresponding to a secondary policy, the secondary policy comprising the one or more features, the clause example, and a second threshold for use in the semantic language evaluator;
obtain a difference data set comprised of a difference between the first feature data set and the second feature data set, the difference data set comprising a non-standard clause, the non-standard clause being semantically related to but not matching the clause example; and
update, automatically in response to obtaining the difference data set, a database to identify the standard exact clause and the non-standard clause from the plurality of documents.
4 Assignments
0 Petitions
Accused Products
Abstract
Embodiments relate to a system and a method for identifying, from contractual documents, (i) standard exact clauses matching clause examples and (ii) non-standard clauses semantically related to but not matching the clause examples. A standard feature data set comprising standard exact clauses matching clause examples is obtained. In addition, a mirror feature data set comprising semantically related clauses of the clause examples is obtained using semantic language analysis, where the mirror feature data set encompasses the standard feature data set. Non-standard clauses are obtained by extracting a difference between the mirror feature data set and the standard exact feature data set.
-
Citations
21 Claims
-
1. A non-transitory computer readable medium storing program code for determination of standard exact clauses and non-standard clauses from a plurality of documents, the program code comprising instructions that when executed by a processor cause the processor to:
-
obtain a primary policy comprising one or more features, a clause example, and a first threshold for use in a semantic language evaluator to generate a plurality of feature replaced clauses by automatically replacing one or more of a plurality of original clauses in a plurality of documents with the one or more features and to compare each of the plurality of feature replaced clauses according to the primary policy to provide a first feature data set comprising standard clauses; replace, automatically, an available variation of one of the standard clauses in the plurality of documents with a variable; compare a clause and at least one of the clause example and the variable, the clause obtained from the plurality of documents with the available variation replaced with the variable; obtain, in response to the comparison, a standard exact clause comprising the clause matching at least one of the clause example and the variable; obtain a second feature data set encompassing the first feature data set, the second feature data set corresponding to a secondary policy, the secondary policy comprising the one or more features, the clause example, and a second threshold for use in the semantic language evaluator; obtain a difference data set comprised of a difference between the first feature data set and the second feature data set, the difference data set comprising a non-standard clause, the non-standard clause being semantically related to but not matching the clause example; and update, automatically in response to obtaining the difference data set, a database to identify the standard exact clause and the non-standard clause from the plurality of documents. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A non-transitory computer readable medium storing program code for determination of standard exact clauses and non-standard clauses from a plurality of documents, the program code comprising instructions that when executed by a processor cause the processor to:
-
obtain a primary policy comprising one or more features, a clause example, and a first threshold for use in a semantic language evaluator to generate a plurality of feature replaced clauses by automatically replacing one or more of a plurality of original clauses in a document with the one or more features and to compare each of the plurality of feature replaced clauses according to the primary policy to provide a first feature data set comprising standard clauses; replace, automatically, an available variation of one of the standard clauses in the document with variable; compare a clause and at least one of the clause example and the variable, the clause obtained from the document with the available variation replaced with the variable; obtain, in response to the comparison, a standard exact clause comprising the clause matching at least one of the clause example and the variable; replace, automatically, available variations of one or more of the standard clauses in the document with additional variables; compare additional clauses and additional clause examples to obtain additional standard exact clauses, the additional clauses obtained from the document with the available variations replaced with the additional variables, the additional standard exact clauses comprising the additional clauses matching respective ones of the additional clause examples and the additional variables; obtain a secondary policy comprising the one or more features, the clause example, and a second threshold for use in the semantic language evaluator to compare each of the plurality of feature replaced clauses according to the secondary policy to provide a second feature data set encompassing the first feature data set; obtain a third feature data set comprising the standard exact clause and the additional standard exact clauses; obtain a difference data set comprised of a difference between the second feature data set and the third feature data set, the difference data set comprising non-standard clauses; and update, automatically, a database to identify the standard exact clause and the non standard clauses from the document.
-
-
7. A computer implemented method for determination of standard exact clauses and non-standard clauses from a plurality of documents, the method comprising:
obtaining a primary policy comprising one or more features, a clause example, and a first threshold for generating a plurality of feature replaced clauses by automatically replacing one or more of the original clauses in the plurality of documents with a feature of the one or more features; comparing each of the plurality of feature replaced clauses and a clause example using a semantic language evaluator to obtain a first feature data set comprising standard clauses; automatically replacing an available variation of one of the standard clauses in the plurality of documents with a variable; comparing a clause and at least one of the clause example and the variable, the clause obtained from the plurality of documents with the available variation replaced with the variable; obtaining, in response to the comparison, the standard exact clause comprising the clause matching at least one of the clause example and the variable; obtaining a second feature data set encompassing the first feature data set, the second feature data set corresponding to a secondary policy, the secondary policy comprising the one or more features, the clause example, and a second threshold for use in the semantic language evaluator; obtaining a difference data set comprised of a difference between the first feature data set and the second feature data set, the difference data set comprising a non-standard clause, the non-standard clause being semantically related to but not matching the clause example; and updating, automatically in response to obtaining the difference data set, a database to identify the standard exact clause and the non-standard clause from the plurality of documents. - View Dependent Claims (8, 9, 10, 11, 12)
-
13. A non-transitory computer readable medium storing program code for determination of standard exact clauses and non-standard clauses from a plurality of documents, the program code comprising instructions that when executed by a processor cause the processor to:
-
obtain a primary policy comprising one or more features, a clause example, and a first threshold to generate a plurality of feature replaced clauses by automatically replacing one or more original clauses in a plurality of documents with a feature of the one or more features; compare each of the plurality of feature replaced clauses and a clause example using a semantic language evaluator to obtain a first feature data set comprising standard clauses; automatically replace an available variation of one of the standard clauses in the plurality of documents with a variable; compare a clause and at least one of the clause example and the variable, the clause obtained from the plurality of documents with the available variation replaced with the variable; obtain, in response to the comparison, a standard exact clause comprising the clause matching at least one of the clause example and the variable; obtain a second feature data set encompassing the first feature data set, the second feature data set corresponding to a secondary policy, the secondary policy comprising the one or more features, the clause example, and a second threshold for use in the semantic language evaluator; obtain a difference data set comprised of a difference between the first feature data set and the second feature data set, the difference data set comprising a non-standard clause, the non-standard clause being semantically related to but not matching the clause example; and update, automatically in response to obtaining the difference data set, a database to identify the standard exact clause and the non-standard clause from the plurality of documents. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A system for determination of standard exact clauses and non-standard clauses from the plurality of documents, the system comprising:
-
a document parsing module configured to obtain a primary policy comprising one or more features, a clause example, and a first threshold to generate a plurality of feature replaced clauses by replacing one or more original clauses in the plurality of documents with the one or more features and to replace an available variation of one of standard clauses in the plurality of documents with a variable; a standard clause detection module configured to compare each of the plurality of feature replaced clauses and a clause example using a semantic language evaluator to obtain a first feature data set comprising the standard clauses; a standard exact clause matching module configured to (i) compare a clause and at least one of the clause example and the variable, the clause obtained from the plurality of documents with the available variation replaced with the variable, and (ii) obtain, in response to the comparison, the standard exact clause comprising the clause matching at least one of the clause example and the variable; a non-standard exact clause matching module configured to (i) obtain a second feature data set encompassing the first feature data set, the second feature data set corresponding to a secondary policy, the secondary policy comprising the one or more features, the clause example, and a second threshold for use in the semantic language evaluator, and (ii) obtain a difference data set comprised of a difference between the first feature data set and the second feature data set, the difference data set comprising a non-standard clause, the non-standard clause being semantically related to but not matching the clause example; and a database update module configured to update, automatically in response to obtaining the difference data set, a database to identify the standard exact clause and the non-standard clause from the plurality of documents. - View Dependent Claims (20, 21)
-
Specification