×

Distributed storage of data

  • US 9,953,071 B2
  • Filed: 09/09/2014
  • Issued: 04/24/2018
  • Est. Priority Date: 09/10/2013
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented file generation system for storage of structured data onto a distributed database, the file generation system comprising:

  • a processor;

    an intermediate data generation module coupled to the processor to generate a set of intermediate key-value pairs for each of one or more records in at least one subset of the structured data, wherein a key-value pair includes a key and a value corresponding to the key, and wherein the key is a unique identifier of the value;

    an intermediate data sorting module coupled to the processor to sort the set of intermediate key-value pairs to generate a plurality of output files, wherein each of the plurality of output files includes at least one key-value pair, wherein generating the intermediate key-value pairs and sorting the intermediate key-value pairs is performed by one node without transferring the intermediate key-value pairs over a network;

    a data segregation module coupled to the processor to obtain, from a user device, the structured data to be stored in a distributed database over the network, wherein the structured data is organized as logical records and is identified by a unique key, and wherein each logical record is assigned to a mapper; and

    a file storing module coupled to the processor to store the plurality of output files in the distributed database, wherein the plurality of output files is representative of the structured data.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×