Efficient computation of sketches
First Claim
Patent Images
1. A system for resemblance compression comprising:
- a processor configured to;
divide a data stream into a plurality of segments;
for a segment of the plurality of segments;
compute a summary feature set for the segment;
determine whether the segment resembles a stored segment using the summary feature set; and
in the event that the segment resembles the stored segment, store the segment as a composite of the stored segment and one or more deltas, wherein the stored segment corresponds to a base segment and the one or more deltas at least correspond to a difference between two different segments.
12 Assignments
0 Petitions
Accused Products
Abstract
Determining a summary feature set is disclosed. A plurality of subsegments of a first segment are selected. For each subsegment, a plurality of values by applying a set of functions to each subsegment are computed. From all the values computed for all the subsegments, a first subset of values is selected.
2 Citations
20 Claims
-
1. A system for resemblance compression comprising:
-
a processor configured to; divide a data stream into a plurality of segments; for a segment of the plurality of segments; compute a summary feature set for the segment; determine whether the segment resembles a stored segment using the summary feature set; and in the event that the segment resembles the stored segment, store the segment as a composite of the stored segment and one or more deltas, wherein the stored segment corresponds to a base segment and the one or more deltas at least correspond to a difference between two different segments. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer program product for resemblance compression, the computer program product being embodied in a non-transitory computer readable medium and comprising computer instructions for:
-
dividing a data stream into a plurality of segments; for a segment of the plurality of segments; computing a summary feature set for the segment; determining whether the segment resembles a stored segment using the summary feature set; and in the event that the segment resembles the stored segment, storing the segment as a composite of the stored segment and one or more deltas, wherein the stored segment corresponds to a base segment and the one or more deltas at least correspond to a difference between two different segments.
-
-
20. A method for resemblance compression comprising:
-
dividing, by a processor, a data stream into a plurality of segments; for a segment of the plurality of segments; computing, by the processor, a summary feature set for the segment; determining whether the segment resembles a stored segment using the summary feature set; and in the event that the segment resembles the stored segment, storing in a memory coupled to the processor the segment as a composite of the stored segment and one or more deltas, wherein the stored segment corresponds to a base segment and the one or more deltas at least correspond to a difference between two different segments.
-
Specification