×

Data caching for distributed execution computing

  • US 8,229,968 B2
  • Filed: 03/26/2008
  • Issued: 07/24/2012
  • Est. Priority Date: 03/26/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • storing a plurality of subgraphs of a directed acyclic graph (DAG) in a remote global data storage of a distributed storage system;

    processing multiple subgraphs of the plurality of subgraphs in multiple computing devices of a DAG distributed execution engine, each of the multiple computing devices produces a vertex based on a corresponding subgraph and corresponding input values;

    processing a first subgraph of the plurality of subgraphs from the remote global data storage with associated input values in a computing device of the DAG distributed execution engine in a first iteration by at least producing a vertex using the first subgraph and the associated input values, and aggregating the vertex with vertices produced by the multiple computing devices to generate first output values;

    storing a second subgraph that is a copy of the first subgraph into a local cache of the computing device via a network that connects the local cache to the remote global data storage; and

    processing the second subgraph with the first output values to generate a second output values in response to determining that the computing device is to process the first subgraph in a second iteration,wherein the DAG represents a sparse matrix of linkage information between web pages, each of the associated input values represents a rank value of a source web page to associated destination web pages, and each output value represents a rank value of the source web page to other source web pages.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×