System and method for indexing electronic discovery data
First Claim
Patent Images
1. A method for indexing one or more documents, di, comprising the steps of:
- (a) determining a file type, fi, of each of the one or more documents, di;
(b) performing an extraction, ei, of data, dai, from the one or more documents, di;
(c) testing the data, dai, recovered from the extraction, ei, of document, di, for one or more embedded objects, dk, and if one or more embedded objects, dk, are detected, appending data, dai, from the one or more embedded objects, dk, to a buffer wherein the data is present in the one or more documents, di, and(d) repeating steps (a) to (c) recursively for the one or more documents, di, until no additional embedded objects, dk, are detected in the one or more documents, di.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for efficiently processing electronically stored information (ESI) are described. The systems and methods describe processing ESI in preparation for, or association with, litigation. The invention preserves the contextual relationships among documents when processing and indexing data, allowing for increased precision and recall during data analytics.
51 Citations
30 Claims
-
1. A method for indexing one or more documents, di, comprising the steps of:
-
(a) determining a file type, fi, of each of the one or more documents, di; (b) performing an extraction, ei, of data, dai, from the one or more documents, di; (c) testing the data, dai, recovered from the extraction, ei, of document, di, for one or more embedded objects, dk, and if one or more embedded objects, dk, are detected, appending data, dai, from the one or more embedded objects, dk, to a buffer wherein the data is present in the one or more documents, di, and (d) repeating steps (a) to (c) recursively for the one or more documents, di, until no additional embedded objects, dk, are detected in the one or more documents, di. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer system for reviewing data, the computer system comprising:
-
(a) a source of a plurality of electronic documents; (b) a file ripper for extracting data from at least one document, di, from the plurality of electronic documents; (i) wherein the file ripper tests each document, di, for linked or embedded objects, dk; (ii) wherein the file ripper recursively repeats step (i) if additional linked or embedded objects, dk, are detected; and (c) an index, i, comprising data from the documents and objects, di and dk, wherein the index preserves hierarchical relationships among di and dk;
di and dk each have at least one individual identifier; and
a visual representation of dk within di is preserved through the use of an object map, m. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification