Efficient data storage system
First Claim
Patent Images
1. A method for storing data comprising:
- receiving a data stream comprising a plurality of data segments;
assigning an identifier to one of the plurality of data segments; and
determining whether one of the plurality of data segments has been stored previously using a summary, wherein the summary is a space efficient, probabilistic summary of segment information.
11 Assignments
0 Petitions
Accused Products
Abstract
A system and method are disclosed for providing efficient data storage. A data stream comprising a plurality of data segments is received. The system determines whether one of the plurality of data segments has been stored previously using a summary in a low latency memory; in the event that the data segment is determined not to have been stored previously, assigning an identifier to the data segment.
40 Citations
16 Claims
-
1. A method for storing data comprising:
-
receiving a data stream comprising a plurality of data segments; assigning an identifier to one of the plurality of data segments; and determining whether one of the plurality of data segments has been stored previously using a summary, wherein the summary is a space efficient, probabilistic summary of segment information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A data storage device comprising:
-
an input interface adapted to receive a data stream comprising a plurality of data segments; and a segment redundancy check engine configured to receive a data stream comprising a plurality of data segments, assign an identifier to one of the plurality of data segments, and determine whether one of the plurality of data segments has been stored previously using a summary, wherein the summary is a space efficient, probabilistic summary of segment information.
-
-
16. A computer program product for storing data, the computer program product being embodied in a computer readable medium and comprising computer instructions for:
-
receiving a data stream comprising a plurality of data segments; assigning an identifier to one of the plurality of data segments; and determining whether one of the plurality of data segments has been stored previously using a summary, wherein the summary is a space efficient, probabilistic summary of segment information.
-
Specification