Data storage using identifiers
First Claim
Patent Images
1. A method for processing a data stream comprising:
- receiving a data segment;
determining whether the data segment has been previously stored; and
in the event that the data segment is determined not to have been previously stored, generating a unique identifier for uniquely specifying the data segment in a representation of the data stream without probability of collision with a different data segment in the data stream;
whereingenerating the unique identifier includes;
deriving a content derived summary based on content of the data segment;
determining whether a portion of the content derived summary has been previously stored; and
in the event that the portion of the content derived summary has not been previously stored, using the portion of the content derived summary as a unique identifier;
and whereinthe unique identifier is different from the content derived summary.
12 Assignments
0 Petitions
Accused Products
Abstract
A system and method are disclosed for processing a data stream. A data segment is received. It is determined whether the data segment has been previously stored. In the event that the data segment is determined not to have been previously stored, a unique identifier for specifying the data segment in a representation of the data stream is generated.
8 Citations
21 Claims
-
1. A method for processing a data stream comprising:
-
receiving a data segment; determining whether the data segment has been previously stored; and in the event that the data segment is determined not to have been previously stored, generating a unique identifier for uniquely specifying the data segment in a representation of the data stream without probability of collision with a different data segment in the data stream;
whereingenerating the unique identifier includes; deriving a content derived summary based on content of the data segment; determining whether a portion of the content derived summary has been previously stored; and in the event that the portion of the content derived summary has not been previously stored, using the portion of the content derived summary as a unique identifier; and wherein the unique identifier is different from the content derived summary. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for processing data stream comprising:
-
an interface configured to receive a data segment; a processor coupled to the interface, configured to; determine whether the data segment has been previously stored; and in the event that the data segment is determined not to have been previously stored, generate a unique identifier for uniquely specifying the data segment in a representation of the data stream without probability of collision with a different data segment in the data stream;
whereingenerating the unique identifier includes; deriving a content derived summary based on content of the data segment; determining whether a portion of the content derived summary has been previously stored; in the event that the portion of the content derived summary has not been previously stored, using the portion of the content derived summary as a unique identifier; wherein the unique identifier is different from the content derived summary. - View Dependent Claims (15, 16, 17)
-
-
18. A computer program product for processing a data stream, the computer program product being embodied in a computer readable storage medium and comprising computer instructions for:
-
receiving a data segment; determining whether the data segment has been previously stored; and in the event that the data segment is determined not to have been previously stored, generating a unique identifier uniquely specifying the data segment in a representation of the data stream without probability of collision with a different data segment in the data stream;
whereingenerating the unique identifier includes; deriving a content derived summary based on content of the data segment; determining whether a portion of the content derived summary has been previously stored; and in the event that the portion of the content derived summary has not been previously stored, using the portion of the content derived summary as a unique identifier; and wherein the unique identifier is different from the content derived summary. - View Dependent Claims (19, 20, 21)
-
Specification