SYSTEM AND METHOD FOR EFFICIENT STORAGE OF SMALL FILES ON FILE-SYSTEM-BASED STORAGE DEVICES
First Claim
1. A method for data storage, comprising:
- receiving a plurality of files to be stored on a storage device that is managed by a File System (FS);
determining an association between at least first and second duplicate occurrences of a given file in the plurality;
assigning the files, including the given file, to one or more archives based on the association; and
sending the archives to the FS for storage on the storage device.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for storing and managing large numbers of small files. A data processing system includes clients that generate large numbers be stored on a storage device managed by a File System (FS). An Archive Server (AS) receives multiple files from the client, archives the files in larger archives, and sends the archives to the FS for storage. When requested to read a file, the AS retrieves the archive in which the file is stored, extracts the file and sends it to the requesting client. In other words, the AS communicates with the clients in individual file units, and with the storage device in archive units. The AS is typically constructed as an add-on layer on top of a conventional FS, which enables the FS to handle small files efficiently without modification.
5 Citations
19 Claims
-
1. A method for data storage, comprising:
-
receiving a plurality of files to be stored on a storage device that is managed by a File System (FS); determining an association between at least first and second duplicate occurrences of a given file in the plurality; assigning the files, including the given file, to one or more archives based on the association; and sending the archives to the FS for storage on the storage device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An archiving system, comprising:
-
an interface, which is configured to communicate with one or more client devices; and one or more processors, which are configured to receive over the interface from the client devices a plurality of files to be stored on a storage device that is managed by a File System (FS), to determine an association between at least first and second duplicate occurrences of a given file in the plurality, to assign the files, including the given file, to one or more archives based on the association, and to send the archives to the FS for storage on the storage device. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A data processing system, comprising:
-
a storage device that is managed by a File System (FS); one or more client devices, which are configured to generate a plurality of files for storage on the storage device; and one or more archiving processors, which are configured to determine an association between at least first and second duplicate occurrences of a given file in the plurality, to assign the files, including the given file, to one or more archives based on the association, and to send the archives to the FS for storage on the storage device.
-
Specification