×

Distributed computing backup and recovery system

  • US 10,102,264 B2
  • Filed: 11/25/2014
  • Issued: 10/16/2018
  • Est. Priority Date: 09/30/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method for distributed computing backup and recovery, comprising:

  • retrieving a user selectable preference;

    identifying a first subset of data from within a data set according to at least one user selectable preference, the first subset of data comprising less than all of the data in the data set, and wherein the first subset is selectable by the user selectable preference;

    intercepting the first subset of data at an application programming interface (API);

    encrypting, by the API, at least a portion of the first subset of data into encrypted data objects that comprise a second subset of data that is less than all of the data in the first subset of data;

    receiving, into a memory via an interface controlled by a processor connected to a network in a computing environment, the second subset of data;

    evaluating, using the processor, a hash function stored in the memory to determine network storage locations or network retrieval locations, or both, for the encrypted data objects;

    storing, at a granular level that is less than all the data in the data set, multiple replica sets of the encrypted data objects across a plurality of different storage nodes included in the network storage locations according to the hash function, wherein each replica set of the encrypted data objects is stored across a respective cluster group of storage nodes from within the plurality of different storage nodes, and wherein the encrypted data objects stored across the plurality of different storage nodes are identified as replicas of data in the data set;

    retrieving, from the multiple replica sets of the encrypted data sets stored across their respective cluster groups of storage nodes, a preferred replica set of the encrypted data objects stored on a preferred cluster group of storage nodes in the network retrieval locations according to the hash function and an additional selection criteria including data freshness of the preferred replica set of the encrypted data objects, wherein the storage nodes comprising the preferred cluster group share the same hash function, and wherein data freshness includes a storage time of a respective encrypted data object;

    determining a hash seed used to recreate the hash function for an identified time and storing the hash seed for an identified time, at the plurality of different storage nodes included in the network storage locations,wherein the hash seed for the identified time is used to recreate the hash function for the identified time;

    wherein the hash seed is a random function based on an initial seed; and

    when a data object request comprises a request to store the encrypted data objects, recording write accesses that occur in the computing environment until the encrypted data objects are stored; and

    when the data object request comprises a request to retrieve the encrypted data objects, playing back the recording of write accesses until a restore completes.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×