×

Data locality in data integration applications

  • US 9,462,058 B1
  • Filed: 02/23/2016
  • Issued: 10/04/2016
  • Est. Priority Date: 11/19/2015
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method comprising:

  • accessing a configuration file;

    identifying a logical node, said logical node being associated with one or more source stages;

    identifying one or more file block components, said one or more file block components comprising a retrieval target for at least one of said one or more source stages and being stored on a distributed file system;

    identifying one or more physical nodes;

    determining, for each of said one or more physical nodes, a degree value;

    identifying one or more qualified physical nodes from said one or more physical nodes having said degree value of one or more;

    creating a preferred physical node table, said preferred physical node table comprising, for each of said one or more qualified physical nodes, an identifying indication and an indication of said degree value;

    sorting said preferred physical node table based on said degree value associated with each of said one or more qualified physical nodes;

    determining a candidate preferred physical node based on each said degree value;

    determining whether said candidate preferred physical node is available for allocation to said logical node;

    responsive to said candidate preferred physical node being available for allocation to said logical node, allocating said candidate preferred physical node to said logical node; and

    responsive to said candidate preferred physical node not being available for allocation to said logical node;

    marking said candidate preferred physical node as unavailable for allocation to said logical node; and

    determining an alternative candidate preferred physical node based on each said degree value.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×