EFFICIENTLY ESTIMATING COMPRESSION RATIO IN A DEDUPLICATING FILE SYSTEM
First Claim
Patent Images
1. A system for estimating a quantity of unique identifiers, comprising:
- a processor configured to;
for each of k times;
associate a bin of an ordered set of bins with each received identifier; and
determine a minimum bin number associated with each received identifier; and
determine an estimate of the quantity of unique identifiers based at least in part on an average minimum associated bin number; and
a memory coupled to the processor and configured to provide the processor with instructions.
9 Assignments
0 Petitions
Accused Products
Abstract
A system for estimating a quantity of unique identifiers comprises a processor and a memory. The processor is configured to, for each of k times, associate a bin of a set of bins with each received identifier. The processor is further configured to determine an estimate of the quantity of unique identifiers based at least in part on an average minimum associated bin value. The memory is coupled to the processor and configured to provide the processor with instructions.
-
Citations
19 Claims
-
1. A system for estimating a quantity of unique identifiers, comprising:
-
a processor configured to; for each of k times; associate a bin of an ordered set of bins with each received identifier; and determine a minimum bin number associated with each received identifier; and determine an estimate of the quantity of unique identifiers based at least in part on an average minimum associated bin number; and a memory coupled to the processor and configured to provide the processor with instructions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method for estimating a quantity of unique identifiers comprising:
-
for each of k times; associating a bin of an ordered set of bins with each received identifier; and determining, using a processor, a minimum bin number associated with each received identifier; and determining an estimate of the quantity of unique identifiers based at least in part on an average minimum associated bin number.
-
-
19. A computer program product, the computer program product being embedded in a non-transitory computer readable storage medium and comprising computer instructions for:
-
for each of k times; associating a bin of an ordered set of bins with each received identifier; and determining, using a processor, a minimum bin number associated with each received identifier; and determining an estimate of the quantity of unique identifiers based at least in part on an average minimum associated bin number.
-
Specification