Standard exact clause detection
First Claim
1. A non-transitory computer readable medium storing program code for determining a presence of a type of clause within a plurality of documents, the program code comprising instructions that when executed by a processor cause the processor to:
- receive a clause example corresponding to the type of clause;
generate a primary policy based upon the received clause example for use in a semantic language evaluator configured to assess a level of semantic similarity between received clauses, the primary policy comprising one or more policy rules and associated with a first threshold value indicating a level of semantic similarity of a clause to the clause example;
analyze, using the semantic language evaluator, the plurality of documents according to the primary policy to automatically provide a first set of clauses corresponding of the plurality of documents, each clause of the first set corresponding to a standard clause matching the clause example;
generate a mirror document based upon the plurality of documents by automatically replacing one or more portions of the plurality of documents having allowable variations with corresponding variables;
parse the mirror document to generate a second set of clauses corresponding to a standard exact feature data set;
generate a secondary policy based upon the primary policy and the clause example for use in the semantic language evaluator, the secondary policy associated with a second threshold value indicating a level of semantic similarity of a clause to the clause example that is lower than the first threshold value;
analyze, using the semantic language evaluator, the plurality of documents according to the secondary policy to automatically provide a third set of clauses comprising non-standard clauses semantically related to but not matching the clause example, wherein the third set of clauses corresponds to a mirror feature data set;
obtain a difference between the mirror feature data set and the standard exact feature data set, the difference corresponding to non-standard clauses of the plurality of documents;
update, automatically, a database to identify the standard and non-standard clauses of the plurality of documents associated with the type of clause based upon the obtained difference, for subsequent usage in analyzing the plurality of documents.
2 Assignments
0 Petitions
Accused Products
Abstract
Embodiments relate to a system and a method for identifying, from contractual documents, (i) standard exact clauses matching clause examples and (ii) non-standard clauses semantically related to but not matching the clause examples. A standard feature data set comprising standard exact clauses matching clause examples is obtained. In addition, a mirror feature data set comprising semantically related clauses of the clause examples is obtained using semantic language analysis, where the mirror feature data set encompasses the standard feature data set. Non-standard clauses are obtained by extracting a difference between the mirror feature data set and the standard exact feature data set.
-
Citations
18 Claims
-
1. A non-transitory computer readable medium storing program code for determining a presence of a type of clause within a plurality of documents, the program code comprising instructions that when executed by a processor cause the processor to:
-
receive a clause example corresponding to the type of clause; generate a primary policy based upon the received clause example for use in a semantic language evaluator configured to assess a level of semantic similarity between received clauses, the primary policy comprising one or more policy rules and associated with a first threshold value indicating a level of semantic similarity of a clause to the clause example; analyze, using the semantic language evaluator, the plurality of documents according to the primary policy to automatically provide a first set of clauses corresponding of the plurality of documents, each clause of the first set corresponding to a standard clause matching the clause example; generate a mirror document based upon the plurality of documents by automatically replacing one or more portions of the plurality of documents having allowable variations with corresponding variables; parse the mirror document to generate a second set of clauses corresponding to a standard exact feature data set; generate a secondary policy based upon the primary policy and the clause example for use in the semantic language evaluator, the secondary policy associated with a second threshold value indicating a level of semantic similarity of a clause to the clause example that is lower than the first threshold value; analyze, using the semantic language evaluator, the plurality of documents according to the secondary policy to automatically provide a third set of clauses comprising non-standard clauses semantically related to but not matching the clause example, wherein the third set of clauses corresponds to a mirror feature data set; obtain a difference between the mirror feature data set and the standard exact feature data set, the difference corresponding to non-standard clauses of the plurality of documents; update, automatically, a database to identify the standard and non-standard clauses of the plurality of documents associated with the type of clause based upon the obtained difference, for subsequent usage in analyzing the plurality of documents. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer implemented method for determining a presence of a type of clause within a plurality of documents, the method comprising:
-
receiving a clause example corresponding to the type of clause; generating a primary policy based upon the received clause example for use in a semantic language evaluator configured to assess a level of semantic similarity between received clauses, the primary policy comprising one or more policy rules and associated with a first threshold value-indicating a level of semantic similarity of a clause to the clause example; analyzing, using the semantic language evaluator, the plurality of documents according to the primary policy to automatically provide a first set of clauses corresponding of the plurality of documents, each clause of the first set corresponding to a standard clause matching the clause example; generating a mirror document based upon the plurality of documents by automatically replacing one or more portions of the plurality of documents having allowable variations with corresponding variables; parsing the minor document to generate a second set of clauses corresponding to a standard exact feature data set; generating a secondary policy based upon the primary policy and the clause example for use in the semantic language evaluator, the secondary policy associated with a second threshold value indicating a level of semantic similarity of a clause to the clause example that is lower than the first threshold value; analyzing, using the semantic language evaluator, the plurality of documents according to the secondary policy to automatically provide a third set of clauses comprising non-standard clauses semantically related to but not matching the clause example, wherein the third set of clauses corresponds to a mirror feature data set; obtaining a difference between the mirror feature data set and the standard exact feature data set, the difference corresponding to non-standard clauses of the plurality of documents; and automatically updating a database to identify the standard and non-standard clauses of the plurality of documents associated with the type of clause based upon the obtained difference, for subsequent usage in analyzing the plurality of documents. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A system for determining a presence of a type of clause within a plurality of documents, comprising:
-
a document parsing module configured to receive a clause example corresponding to the type of clause; a policy definition module configured to; generate a primary policy based upon the received clause example for use in a semantic language evaluator configured to assess a level of semantic similarity between received clauses, the primary policy comprising one or more policy rules and associated with a first threshold value indicating a level of semantic similarity of a clause to the clause example; and generate a secondary policy based upon the primary policy and the clause example for use in the semantic language evaluator, the secondary policy associated with a second threshold value indicating a level of semantic similarity of a clause to the clause example that is lower than the first threshold value; an analysis engine configured to; analyze, using the semantic language evaluator, the plurality of documents according to the primary policy to automatically provide a first set of clauses corresponding of the plurality of documents, each clause of the first set corresponding to a standard clause matching the clause example; generate a mirror document based upon the plurality of documents by automatically replacing one or more portions of the plurality of documents having allowable variations with corresponding variables; parse the mirror document to generate a second set of clauses corresponding to a standard exact feature data set; analyze, using the semantic language evaluator, the plurality of documents according to the secondary policy to automatically provide a third set of clauses comprising non-standard clauses semantically related to but not matching the clause example, wherein the third set of clauses corresponds to a minor feature data set; obtain a difference between the mirror feature data set and the standard exact feature data set, the difference corresponding to non-standard clauses of the plurality of documents; and update, automatically, a database to identify the standard and non-standard clauses of the plurality of documents associated with the type of clause based upon the obtained difference, for subsequent usage in analyzing the plurality of documents. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification