Techniques for organizing data to support efficient review and analysis
First Claim
Patent Images
1. A method comprising:
- generating, by a computer system, a summary representation for each electronic document in a plurality of electronic documents, the summary representation representing a summary of content of the electronic document;
filtering, by the computer system, the plurality of electronic documents based upon their summary representations to generate a filtered subset;
organizing, by the computer system, the electronic documents in the filtered subset into a hierarchical collection of folders, the organizing comprising;
determining similarity metrics between the electronic documents in the filtered subset based upon their summary representations;
determining concepts associated with the electronic documents in the filtered subset based upon their summary representations; and
grouping the electronic documents in the filtered subset into the hierarchical collection of folders based upon the similarity metrics and the concepts; and
assigning, by the computer system based upon a set of rules pertaining to document similarity, a number to each electronic document in the filtered subset by traversing the hierarchical collection of folders, such that when the electronic documents in the filtered subset are sorted based upon the assigned numbers to generate a sorted list of electronic documents, related electronic documents occur consecutively in the sorted list of electronic documents.
11 Assignments
0 Petitions
Accused Products
Abstract
Techniques for organizing a corpus of electronic documents. The electronic documents are organized in a manner that facilitates review of the documents. The documents are organized into a concept-based hierarchical collection of folders based upon contents of the documents.
-
Citations
32 Claims
-
1. A method comprising:
-
generating, by a computer system, a summary representation for each electronic document in a plurality of electronic documents, the summary representation representing a summary of content of the electronic document; filtering, by the computer system, the plurality of electronic documents based upon their summary representations to generate a filtered subset; organizing, by the computer system, the electronic documents in the filtered subset into a hierarchical collection of folders, the organizing comprising; determining similarity metrics between the electronic documents in the filtered subset based upon their summary representations; determining concepts associated with the electronic documents in the filtered subset based upon their summary representations; and grouping the electronic documents in the filtered subset into the hierarchical collection of folders based upon the similarity metrics and the concepts; and assigning, by the computer system based upon a set of rules pertaining to document similarity, a number to each electronic document in the filtered subset by traversing the hierarchical collection of folders, such that when the electronic documents in the filtered subset are sorted based upon the assigned numbers to generate a sorted list of electronic documents, related electronic documents occur consecutively in the sorted list of electronic documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system comprising:
-
a memory configured to store a plurality of electronic documents; and a processor; wherein the processor is configured to; generate a summary representation of each electronic document in the plurality of electronic documents, the summary representation representing a summary of content of the electronic document; filter the plurality of electronic documents based upon their summary representations to generate a filtered subset; organize the electronic documents in the filtered subset into a hierarchical collection of folders, the organizing comprising; determining similarity metrics between the electronic documents in the filtered subset based upon their summary representations; determining concepts associated with the electronic documents in the filtered subset based upon their summary representations; and grouping the electronic documents in the filtered subset into the hierarchical collection of folders based upon the similarity metrics and the concepts; and assign, based upon a set of rules pertaining to document similarity, a number to each electronic document in the filtered subset by traversing the hierarchical collection of folders, such that when the electronic documents in the filtered subset are sorted based upon the assigned numbers to generate a sorted list of electronic documents, related electronic documents occur consecutively in the sorted list of electronic documents. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A computer-readable medium storing a plurality of instructions for controlling a data processor, the plurality of instructions comprising:
-
instructions that cause the data processor to generate a summary representation for each electronic document in a plurality of electronic documents, the summary representation representing a summary of content of the electronic document; instructions that cause the data processor to filter the plurality of electronic documents based upon their summary representations to generate a filtered subset; instructions that cause the data processor to organize the electronic documents in the filtered subset into a hierarchical collection of folders, the instructions that cause the data processor to organize comprising; instructions that cause the data processor to determine similarity metrics between the electronic documents in the filtered subset based upon their summary representations; instructions that cause the data processor to determine concepts associated with the electronic documents in the filtered subset based upon their summary representations; and instructions that cause the data processor to group the electronic documents in the filtered subset into the hierarchical collection of folders based upon the similarity metrics and the concepts; and instructions that cause the data processor to assign, based upon a set of rules pertaining to document similarity, a number to each electronic document in the filtered subset by traversing the hierarchical collection of folders, such that when the electronic documents in the filtered subset are sorted based upon the assigned numbers to generate a sorted list of electronic documents, related electronic documents occur consecutively in the sorted list of electronic documents. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32)
-
Specification