×

Search term hit counts in an electronic discovery system

  • US 9,171,310 B2
  • Filed: 03/24/2010
  • Issued: 10/27/2015
  • Est. Priority Date: 03/27/2009
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for determining search term hit counts in an electronic discovery system, the method comprising:

  • identifying an electronic data set comprising data items for collection by an electronic discovery system;

    determining an estimated size of memory required to collect the data items of the electronic data set;

    determining, via a computer device processor, whether or not the estimated size of memory required to collect data items of the electronic data set is below a predetermined threshold;

    collecting, via a computing device processor, the data items in response to determining that the estimated size of memory required to collect the data items of the electronic data set is below the predetermined threshold, thus resulting in a collected data set;

    receiving, at a computing device, inputs that provide for a search term set that includes a plurality of search terms, wherein the search term set is associated with a case in the electronic discovery system and a search term is defined as a word or phrase associated with the case for identifying data items in the collected data set;

    prior to finalizing the search term set that will be applied to all of the collected data associated with the case, determining, via a computing device processor, a plurality of search term hit counts by applying the search term set to a portion of the collected data set,wherein the search term hit counts are defined as a number of data items in the portion of the collected data set in which (1) a specific search term included in the search term set occurs or (2) any one of the search terms in the search term set occur, and wherein the search term hit counts include;

    a per-data type search term hit count for one or more data types in the collected data set, wherein the one or more data types include electronic mail data and electronic file data, anda per-custodian search term hit count for each custodian associated with the case, wherein determining the per-data type search term hit count for one or more data types in the collected data set further comprises determining for each of the one or more data types in the collected data set a number of occurrences of the search term in each of the one or more data types, and wherein the per-custodian search term hit count is defined as a number of data items in the portion of the collected data set in which (1) the specific search term included in the search term set occurs or (2) any one of the search terms in the search term set occur and the data items in the search term hit count are also associated with a corresponding custodian;

    predicting, via a computing device processor, for an entirety of the collected data set based on results of applying the search term set to the portion of the collected data set, a volume of the collected data set required to be reviewed;

    determining, via a computing device processor, for each of the plurality of search terms, a file size, each file size corresponding to an amount of storage space occupied by each of the data items that comprise a corresponding search term; and

    storing, in computing device memory, the plurality of search term hit counts and the associated file size of the data items, wherein storing includes storing the per-custodian search term hit counts in a corresponding custodian profile within a custodian database and storing all of the search term hit counts in an associated search term file within the electronic discovery system.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×