×

Systems and methods for distributed system scanning

  • US 8,214,334 B2
  • Filed: 07/15/2010
  • Issued: 07/03/2012
  • Est. Priority Date: 10/21/2005
  • Status: Active Grant
First Claim
Patent Images

1. A method for handling a failure in a distributed file system, the method comprising:

  • for at least one file of a plurality of files, storing a the file across one or more memory devices of a plurality of nodes in a network, at least some of the files distributed in blocks across multiple nodes;

    creating a data block location information structure associated with the file and listing, for all of the blocks in the file, the memory devices and physical addresses where the blocks are stored;

    creating a consolidated device list which is associated with the file and which comprises an abbreviated listing of each of the one or more memory devices that are listed in the data block location information structure, but which does not include a duplicate listing of a particular memory device for all of the blocks in the file that are stored on the particular memory device, wherein the device list is separate from and includes less information than the data block location information structure;

    storing metadata data structures across the plurality of nodes, at least two of the metadata data structures stored on different nodes, each of the metadata data structures referencing information about a corresponding one of the plurality of files, and the metadata data structures comprising the data block location information structure and the consolidated device list of the corresponding file;

    detecting that one of the memory devices in the network has failed;

    using one or more processors, commanding the nodes to search each consolidated device list stored in each of their respective metadata data structures to determine whether the failed memory device is used to store information referenced by their respective metadata data structures, wherein searching each consolidated device list allows the nodes to make the determination without searching through the data block location information structure, which includes more information than the consolidated device list; and

    receiving an indication from the nodes of which metadata data structures reference information stored on the failed device.

View all claims
  • 12 Assignments
Timeline View
Assignment View
    ×
    ×