DISTRIBUTED STORAGE OF DATA
First Claim
1. A file generation system for storage of structured data onto a distributed database, the file generation system comprising:
- a processor;
an intermediate data generation module coupled to the processor to generate a set of intermediate key-value pairs for each of one or more records in at least one subset of the structured data, wherein a key-value pair includes a key and a value corresponding to the key, and wherein the key is a unique identifier of the value;
an intermediate data sorting module coupled to the processor to sort the set of intermediate key-value pairs to generate a plurality of output files, wherein each of the plurality of output files includes at least one key-value pair; and
a file storing module coupled to the processor to store the plurality of output files in the distributed database, wherein the plurality of output files is representative of the structured data.
1 Assignment
0 Petitions
Accused Products
Abstract
A file generation system for storage of structured data onto a distributed database includes an intermediate data generation module to generate a set of intermediate key-value pairs for each of one or more records in at least one subset of the structured data. A key-value pair includes a key and a value corresponding to the key, where the key is a unique identifier of the value. A file generation system further includes an intermediate data sorting module to sort the set of intermediate key-value pairs to generate a plurality of output files. Each of the plurality of output files includes at least one key-value pair. Further, the file generation system includes a file storing module to store the plurality of output files in the distributed database, where the plurality of output files are representative of the structured data.
19 Citations
16 Claims
-
1. A file generation system for storage of structured data onto a distributed database, the file generation system comprising:
-
a processor; an intermediate data generation module coupled to the processor to generate a set of intermediate key-value pairs for each of one or more records in at least one subset of the structured data, wherein a key-value pair includes a key and a value corresponding to the key, and wherein the key is a unique identifier of the value; an intermediate data sorting module coupled to the processor to sort the set of intermediate key-value pairs to generate a plurality of output files, wherein each of the plurality of output files includes at least one key-value pair; and a file storing module coupled to the processor to store the plurality of output files in the distributed database, wherein the plurality of output files is representative of the structured data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for storage of structured data onto a distributed database, the method comprising:
-
obtaining, from a user device, a plurality of subsets of the structured data to be stored in the distributed database, wherein each of the plurality of subsets includes one or more records; generating a set of intermediate key-value pairs for each of the one or more records in each of the plurality of subsets of the structured data, wherein a key-value pair includes a key and a value corresponding to the key, and wherein the key is a unique identifier of the value; sorting the set of intermediate key-value pairs to generate a plurality of output files, wherein each of the plurality of output files includes at least one key-value pair; and storing the plurality of output files in the distributed database, wherein the plurality of output files is representative of the structured data. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A non-transitory computer-readable medium having embodied thereon a computer program for executing a method comprising:
-
obtaining, from a user device, a plurality of subsets of the structured data to be stored in the distributed database, wherein each of the plurality of subsets includes one or more records; generating a set of intermediate key-value pairs for each of the one or more records in each of the plurality of subsets of the structured data, wherein a key-value pair includes a key and a value corresponding to the key, and wherein the key is a unique identifier of the value; sorting the set of intermediate key-value pairs to generate a plurality of output files, wherein each of the plurality of output files includes at least one key-value pair; and storing the plurality of output files in the distributed database, wherein the plurality of output files is representative of the structured data.
-
Specification