Systems and methods for managing data in cloud storage using deduplication techniques
First Claim
1. A computer-implemented method for managing data in cloud storage using deduplication techniques, comprising:
- receiving, by a server from a client, a file divided, by the client, into a plurality of segments of data;
identifying unique segments of data and one or more groups of identical segments of data among the plurality of segments of data;
writing, by a processor, a portion of the plurality of segments of data to at least one container file, the at least one container file being associated with a single client, wherein the portion of the plurality of segments of data comprises each unique segment of data and one segment from each group of identical segments of data;
creating, by the processor, an index file for the at least one container file, wherein the index file comprises an association with client, and information regarding the portion of the plurality of segments of data written to the at least one container file; and
transmitting an object from the server to a virtual container on cloud storage, the object comprising the at least one container file, wherein a segment of the unique segments of data is maintained in the object in the same spatial locality as any other segments of the unique segments of data from the same backup source.
7 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method for managing data in cloud storage using deduplication techniques is described. At least one unique segment of data is received from at least one backup source. The at least one unique segment of data is written to at least one container file associated with the at least one back up source. An index file is created for the at least one container file. The index file includes information regarding the unique segments of data. A plurality of the unique segments of data is transmitted from the at least one container file as an object to a virtual container on cloud storage. A virtual container is composed of an index file and one or more container files, and the index file and each container file are stored as individual cloud storage objects. Spatial locality of the plurality of unique segments of data is maintained in cloud storage.
89 Citations
20 Claims
-
1. A computer-implemented method for managing data in cloud storage using deduplication techniques, comprising:
-
receiving, by a server from a client, a file divided, by the client, into a plurality of segments of data; identifying unique segments of data and one or more groups of identical segments of data among the plurality of segments of data; writing, by a processor, a portion of the plurality of segments of data to at least one container file, the at least one container file being associated with a single client, wherein the portion of the plurality of segments of data comprises each unique segment of data and one segment from each group of identical segments of data; creating, by the processor, an index file for the at least one container file, wherein the index file comprises an association with client, and information regarding the portion of the plurality of segments of data written to the at least one container file; and transmitting an object from the server to a virtual container on cloud storage, the object comprising the at least one container file, wherein a segment of the unique segments of data is maintained in the object in the same spatial locality as any other segments of the unique segments of data from the same backup source. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computing device configured to manage data in cloud storage using deduplication techniques, comprising:
-
a processor; memory in electronic communication with the processor; a deduplication module configured to; receive, by a server from a client, a file divided, by the client, into a plurality of segments of data; write a portion of the plurality of segments of data to at least one container file, the at least one container file being associated with a single client, wherein the portion of the plurality of segments of data comprises each unique segment of data and one segment from each group of identical segments of data; create an index file for the at least one container file, wherein the index file comprises an association with the client, and information regarding the portion of the plurality of segments of data written to the at least one container file; and transmit an object from the server to a virtual container on cloud storage, the object comprising the at least one container file, wherein a segment of the unique segments of data is maintained in the object in the same spatial locality as any other segments of the unique segments of data from the same backup source. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A computer-program product for managing data in cloud storage using deduplication techniques, the computer-program product comprising a non-transitory computer-readable medium having instructions thereon, the instructions comprising:
-
code programmed to receive, by a server from a client, a file divided, by the client, into a plurality of segments of data; code programmed to write a portion of the plurality of segments of data to at least one container file, the at least one container file being associated with a single client, wherein the portion of the plurality of segments of data comprises each unique segment of data and one segment from each group of identical segments of data; code programmed to create an index file for the at least one container file, wherein the index file comprises an association with the client, and information regarding the portion of the plurality of segments of data written to the at least one container file; and code programmed to transmit an object from the server to a virtual container on cloud storage, the object comprising the at least one container file, wherein a segment of the unique segments of data is maintained in the object in the same spatial locality as any other segments of the unique segments of data from the same backup source.
-
Specification