Data Deduplication Apparatus and Method for Storing Data Received in a Data Stream From a Data Store
First Claim
Patent Images
1. Data deduplication apparatus for storing data received in a data stream from a data source, the apparatus comprising;
- an encoded entity handler operable to;
identify, in the data stream, meta data associated with an encoded data entity, the meta data relating to an encoding process that has been used to encode the encoded data entity;
use the meta data to decode the encoded data entity to provide a decoded form thereof; and
substitute said decoded form of the encoded data entity for the encoded form thereof in the data stream; and
a deduplication engine to;
perform deduplication on the data stream including at least one said decoded data entity to provided deduplicated data; and
store the deduplicated data to a deduplicated data store.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of storing data received in a data stream from a data source is disclosed in which prior to performing deduplication on the data stream a processor decompresses selected compressed data entities in the data stream to provide a decompressed form of the data entities in the data stream in place of the compressed form, the data stream including the decompressed data entities is deduplicated and the deduplicated data is stored to a deduplicated data store.
62 Citations
17 Claims
-
1. Data deduplication apparatus for storing data received in a data stream from a data source, the apparatus comprising;
-
an encoded entity handler operable to; identify, in the data stream, meta data associated with an encoded data entity, the meta data relating to an encoding process that has been used to encode the encoded data entity; use the meta data to decode the encoded data entity to provide a decoded form thereof; and substitute said decoded form of the encoded data entity for the encoded form thereof in the data stream; and a deduplication engine to; perform deduplication on the data stream including at least one said decoded data entity to provided deduplicated data; and store the deduplicated data to a deduplicated data store. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of storing data received in a data stream from a data source, the method comprising:
-
prior to performing deduplication on a data stream, using a processor to decompress selected compressed data entities in the data stream to provide a decompressed form thereof to replace of the compressed form thereof; deduplicating the data stream including the decompressed data entities; and storing the deduplicated data to a deduplicated data store. - View Dependent Claims (9, 10, 11, 12)
-
-
13. Data deduplication storage apparatus for in-line processing of data received in a data stream from a data source, the apparatus comprising:
-
an encoded entity handler to; receive the data stream and identify meta data in the data stream that is indicative of recognised encoded data formats, the identified meta data being associated with encoded data in the data stream; use the identified meta data to decode the associated encoded data and provide a decoded form of the data in the data stream in place of the encoded form thereof; and remove the identified meta data from the data stream; and a deduplication engine to; receive the data stream downstream of the encoded data entity handler and perform deduplication on the data stream to provide deduplicated data; and secondary storage in which said deduplicated data is stored. - View Dependent Claims (14, 15, 16, 17)
-
Specification