×

Techniques for data retrieval in a distributed computing environment

  • US 10,049,159 B2
  • Filed: 03/18/2014
  • Issued: 08/14/2018
  • Est. Priority Date: 03/15/2013
  • Status: Active Grant
First Claim
Patent Images

1. At least one non-transitory machine-readable storage medium comprising instructions that, when executed by a processor, cause the processor to:

  • receive a data request at a computing node of a distributed computing system to store a set of data, the computing node including a subset of data of the set of data;

    sort the subset of data into a defined order via one or more data sorting sessions;

    iteratively search among the sorted subset of data for a targeted beginning location of a data range that is responsive to the data request, each iteration to;

    receive a proposed value of one or more proposed values from one of a plurality of computing nodes of the distributed computing system, the proposed value used to search for the beginning location of the data range,utilize a binary search to locate a local insert location for the proposed value, the local insert location local to the subset of data of the computing node, the binary search to;

    determine a first sum value of locally stored values indicating a number of locally stored values that are less than the proposed value stored on the computing node,determine a second sum value of globally stored values less than the proposed value, wherein the globally stored values are values stored on the distributed computing system and the second sum value to include a summation of a number of values stored on other computing nodes of the distributed computing system and the first sum value,determine whether the second sum value is less than, equal to, or greater than the targeted beginning location,based on determining the second sum value is less than the targeted beginning location, the local insert location is set as a current lower bound of a bracketed range, andbased on determining the second sum value is greater than the targeted beginning location, the local insert location is set as an upper bound of the bracketed range, andbased on determining that the second sum value equals the targeted beginning location, set the local insert location to be the targeted beginning location, and cease searching among the sorted subset of data,share, by the computing node, a location of a greatest local value known to be less than the proposed value with the other computing nodes within the distributed system,cease searching among the sorted subset of data when the bracketed range equals zero, anddetermine, during at least one of the iterations, one of the one or more proposed values and share the one of the one or more proposed values with other computing nodes within the distributed computing system; and

    forward, from the computing node, data responsive to the search to another computing node of the distributed computing system to be merged with one or more other data from one or more other computing nodes.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×