Burst buffer appliance with small file aggregation
First Claim
1. An apparatus comprising:
- a burst buffer appliance adapted for coupling between a computer system and a file system;
the burst buffer appliance comprising;
a high-speed memory having a substantially lower access time than the file system; and
an analytics engine;
the burst buffer appliance being configured to aggregate multiple data files having a size below a designated threshold into a container having at least one associated metadata file for processing by the analytics engine;
wherein the burst buffer appliance directs storage of the container in the file system;
wherein the analytics engine is configured to generate at least a content-based portion of the one or more metadata files of the container based at least in part on content of one or more of the multiple data files so as to permit one or more subsequent processing operations to be performed using the one or more metadata files rather than the corresponding multiple data files.
9 Assignments
0 Petitions
Accused Products
Abstract
A burst buffer appliance is adapted for coupling between a computer system and a file system. The burst buffer appliance comprises a flash memory or other high-speed memory having a substantially lower access time than the file system, and an analytics engine, and is configured to aggregate multiple data files having a size below a designated threshold into a container having at least one associated metadata file for processing by the analytics engine. The burst buffer appliance directs storage of the container in the file system. The analytics engine is configured to generate at least a content-based portion of the one or more metadata files of the container based at least in part on content of one or more of the multiple data files so as to permit one or more subsequent processing operations to be performed using the metadata file(s) rather than the corresponding multiple data files.
39 Citations
18 Claims
-
1. An apparatus comprising:
-
a burst buffer appliance adapted for coupling between a computer system and a file system; the burst buffer appliance comprising; a high-speed memory having a substantially lower access time than the file system; and an analytics engine; the burst buffer appliance being configured to aggregate multiple data files having a size below a designated threshold into a container having at least one associated metadata file for processing by the analytics engine; wherein the burst buffer appliance directs storage of the container in the file system; wherein the analytics engine is configured to generate at least a content-based portion of the one or more metadata files of the container based at least in part on content of one or more of the multiple data files so as to permit one or more subsequent processing operations to be performed using the one or more metadata files rather than the corresponding multiple data files. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method comprising:
-
configuring a burst buffer appliance adapted for coupling between a computer system and a file system to include a high-speed memory having a substantially lower access time than the file system; and aggregating in the burst buffer appliance multiple files having a size below a designated threshold into a container having at least one associated metadata file; processing the container in an analytics engine of the burst buffer appliance; and storing the container in the file system; wherein processing the container in the analytics engine comprises generating at least a content-based portion of the one or more metadata files of the container based at least in part on content of one or more of the multiple data files so as to permit one or more subsequent processing operations to be performed using the one or more metadata files rather than the corresponding multiple data files. - View Dependent Claims (12, 13, 14, 15)
-
-
16. An information processing system comprising:
-
a computer system; a file system; and a burst buffer appliance coupled between the computer system and the file system, the burst buffer appliance comprising; a high-speed memory having a substantially lower access time than the file system; and an analytics engine; the burst buffer appliance being configured to aggregate multiple files having a size below a designated threshold into a container having at least one associated metadata file for processing by the analytics engine; wherein the burst buffer appliance directs storage of the container in the file system; wherein the analytics engine is configured to generate at least a content-based portion of the one or more metadata files of the container based at least in part on content of one or more of the multiple data files so as to permit one or more subsequent processing operations to be performed using the one or more metadata files rather than the corresponding multiple data files. - View Dependent Claims (17, 18)
-
Specification