System and method for providing classification suggestions using document injection
First Claim
1. A system for providing classification suggestions using document injection, comprising:
- clusters each comprising uncoded documents;
a set of reference documents, each reference document associated with a classification code;
a set of the uncoded documents selected from one or more of the clusters;
a comparison module to compare the set of uncoded documents with the set of reference documents;
an identification module to identify those reference documents that are similar to the set of uncoded documents;
an injection module to inject the similar reference documents into one or more of the clusters from which the set of uncoded documents are selected;
a display to display the clusters and to provide a visual suggestion for classification of at least one of the uncoded documents within one of the clusters based on the similar reference documents in that cluster; and
a classification assignment module to count a number of reference documents within the cluster for each different type of classification code, to determine a distance between the uncoded document and each of the reference documents in the cluster, to weigh the count of the reference documents for each type of classification code based on the distances of the reference documents associated with that classification code, and to assign the classification code having the highest weighted count to the uncoded document.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method for providing classification suggestions using document injection is provided. Clusters of uncoded documents are accessed. A set of reference documents is obtained. Each reference document is associated with a classification code. A set of the uncoded documents selected from one or more of the clusters is identified and compared with the set of reference documents. Those reference documents that are similar to the set of uncoded documents are identified and injected into one or more of the clusters from which the set of uncoded documents is selected. The clusters and a visual suggestion for classification of at least one of the uncoded documents within one of the clusters are displayed.
-
Citations
20 Claims
-
1. A system for providing classification suggestions using document injection, comprising:
-
clusters each comprising uncoded documents; a set of reference documents, each reference document associated with a classification code; a set of the uncoded documents selected from one or more of the clusters; a comparison module to compare the set of uncoded documents with the set of reference documents; an identification module to identify those reference documents that are similar to the set of uncoded documents; an injection module to inject the similar reference documents into one or more of the clusters from which the set of uncoded documents are selected; a display to display the clusters and to provide a visual suggestion for classification of at least one of the uncoded documents within one of the clusters based on the similar reference documents in that cluster; and a classification assignment module to count a number of reference documents within the cluster for each different type of classification code, to determine a distance between the uncoded document and each of the reference documents in the cluster, to weigh the count of the reference documents for each type of classification code based on the distances of the reference documents associated with that classification code, and to assign the classification code having the highest weighted count to the uncoded document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for providing classification suggestions using document injection, comprising:
-
accessing clusters each comprising uncoded documents; obtaining a set of reference documents, each reference document associated with a classification code; identifying a set of the uncoded documents selected from one or more of the clusters; comparing the set of uncoded documents with the set of reference documents; identifying those reference documents that are similar to the set of uncoded documents; injecting the similar reference documents into one or more of the clusters from which the set of uncoded documents are selected; displaying the clusters and providing a visual suggestion for classification of at least one of the uncoded documents within one of the clusters based on the similar reference documents in that cluster; and for each different type of classification code, counting a number of reference documents within the cluster for that classification code; determining a distance between the uncoded document and each of the reference documents in the cluster; weighing the count of the reference documents for each type of classification code based on the distances of the reference documents associated with that classification code; and assigning the classification code having the highest weighted count to the uncoded document. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification