Computer-implemented system and method for inclusion-based electronically stored information item cluster visual representation
First Claim
1. A computer-implemented system for inclusion-based electronically stored information item cluster visual representation, comprising:
- a non-transitory computer readable storage medium comprising program code; and
a computer processor configured coupled to the storage medium, wherein the processor is configured to execute the program code to perform steps to;
maintain a set of reference electronically stored information items;
select from the set a subset of the electronically stored information items, each of the reference electronically stored information items in the subset associated with a classification code, each of the classification codes associated with a visual representation different from the visual representations of the remaining classification codes;
combine the subset with a set of uncoded electronically stored information items, each of the uncoded electronically stored information items associated with a visual representation different from the visual representations of the classification codes;
group the combined electronically stored information items into clusters, further comprising;
convert each of the combined electronically stored information items into one or more tokens;
generate a score vector for each of the electronically stored information items based on the tokens associated with that electronically stored information item, further comprising;
score each of the tokens;
generate paired values for each of the combined electronically stored information items comprising paring the token with the score associated with that token; and
for each of the combined electronically stored information items, order the paired values along a vector for that combined electronically stored information item to create the score vector for that electronically stored information item, wherein the tokens are ordered along the vector based on a frequency of the tokens within that combined electronically stored information item; and
compare the score vector for each of the combined electronically stored information items, wherein the clustering is performed based on the comparison; and
visually represent each of the clusters comprising displaying the visual representation associated with the code of each of the reference electronically stored information items in that cluster and the visual representation associated with each of the uncoded electronically stored information item in that cluster.
3 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented system and method for inclusion-based electronically stored information item cluster visual representation is provided. A set of reference electronically stored information items is maintained. A subset of the electronically stored information items is selected from the set, each associated with a classification code, each of the classification codes associated with a visual representation different from the visual representations of the remaining classification codes. The subset is combined with a set of uncoded electronically stored information items, each associated with a visual representation different from the visual representations of the classification codes. The combined electronically stored information items are grouped into clusters. Each of the clusters is visually represented, including displaying the visual representation associated with the code of each of the reference electronically stored information items in that cluster and the visual representation associated with each of the uncoded electronically stored information item in that cluster.
371 Citations
14 Claims
-
1. A computer-implemented system for inclusion-based electronically stored information item cluster visual representation, comprising:
-
a non-transitory computer readable storage medium comprising program code; and a computer processor configured coupled to the storage medium, wherein the processor is configured to execute the program code to perform steps to; maintain a set of reference electronically stored information items; select from the set a subset of the electronically stored information items, each of the reference electronically stored information items in the subset associated with a classification code, each of the classification codes associated with a visual representation different from the visual representations of the remaining classification codes; combine the subset with a set of uncoded electronically stored information items, each of the uncoded electronically stored information items associated with a visual representation different from the visual representations of the classification codes; group the combined electronically stored information items into clusters, further comprising; convert each of the combined electronically stored information items into one or more tokens; generate a score vector for each of the electronically stored information items based on the tokens associated with that electronically stored information item, further comprising; score each of the tokens; generate paired values for each of the combined electronically stored information items comprising paring the token with the score associated with that token; and for each of the combined electronically stored information items, order the paired values along a vector for that combined electronically stored information item to create the score vector for that electronically stored information item, wherein the tokens are ordered along the vector based on a frequency of the tokens within that combined electronically stored information item; and compare the score vector for each of the combined electronically stored information items, wherein the clustering is performed based on the comparison; and visually represent each of the clusters comprising displaying the visual representation associated with the code of each of the reference electronically stored information items in that cluster and the visual representation associated with each of the uncoded electronically stored information item in that cluster. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method for inclusion-based electronically stored information item cluster visual representation, comprising the steps of:
-
maintaining a set of reference electronically stored information items; selecting from the set a subset of the electronically stored information items, each of the reference electronically stored information items in the subset associated with a classification code, each of the classification codes associated with a visual representation different from the visual representations of the remaining classification codes; combining the subset with a set of uncoded electronically stored information items, each of the uncoded electronically stored information items associated with a visual representation different from the visual representations of the classification codes; grouping the combined electronically stored information items into clusters, further comprising; converting each of the combined electronically stored information items into one or more tokens; generating a score vector for each of the electronically stored information items based on the tokens associated with that electronically stored information item, further comprising; scoring each of the tokens; generating paired values for each of the combined electronically stored information items comprising paring the token with the score associated with that token; and for each of the combined electronically stored information items, ordering the paired values along a vector for that combined electronically stored information item to create the score vector for that electronically stored information item, wherein the tokens are ordered along the vector based on a frequency of the tokens within that combined electronically stored information item; and comparing the score vector for each of the combined electronically stored information items, wherein the clustering is performed based on the comparison; and visually representing each of the clusters comprising displaying the visual representation associated with the code of each of the reference electronically stored information items in that cluster and the visual representation associated with each of the uncoded electronically stored information item in that cluster, wherein the steps are performed on a suitably-programmed computer. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification