×

Geographically-distributed file system using coordinated namespace replication over a wide area network

  • US 9,495,381 B2
  • Filed: 03/31/2014
  • Issued: 11/15/2016
  • Est. Priority Date: 01/12/2005
  • Status: Active Grant
First Claim
Patent Images

1. A cluster of nodes comprising computing devices configured to implement a single geographically-distributed file system, the cluster comprising:

  • a first data center, comprising;

    a plurality of first DataNode computing devices, each configured to store data blocks of client files;

    a plurality of first local persistent storages;

    a plurality of first NameNode computing devices, each configured to update a state of a namespace of the cluster and each configured to store the updated state of the namespace in a first local persistent storage of the plurality of first local persistent storages;

    a second data center that is geographically remote from and coupled to the first data center by a wide area network, the second data center comprising;

    a plurality of second DataNode computing devices, each configured to store data blocks of client files;

    a plurality of second local persistent storages;

    a plurality of second NameNode computing devices, each configured to update the state of the namespace of the cluster and each configured to store the updated state of the namespace in a second local persistent storage of the plurality of second local persistent storages;

    wherein the plurality of first and second NameNode computing devices are configured to update the state of the namespace responsive to data blocks being written to the plurality of first and second DataNode computing devices; and

    a coordination engine process spanning the plurality of first NameNode computing devices and the plurality of second NameNode computing devices, the coordination engine process being configured to coordinate updates to the state of the namespace stored by the plurality of first and second NameNode computing devices such that the state of the namespace is maintained consistent across the first and second data centers of the cluster,wherein the coordination engine process is configured to receive proposals from the first and second plurality of NameNode computing devices to update the state of the namespace and to generate, in response, an ordered set of agreements that specifies an order in which the plurality of first and second NameNode computing devices are to update their respective stored state of the namespace, and wherein the plurality of first and second NameNode computing devices are configured to delay updates to the state of the namespace until the ordered set of agreements is received from the coordination engine process.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×