×

CACHE MANAGEMENT FOR MAP-REDUCE APPLICATIONS

  • US 20160062900A1
  • Filed: 08/18/2015
  • Published: 03/03/2016
  • Est. Priority Date: 08/29/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method for managing a cache for a MapReduce application on a distributed file system, the method comprising:

  • receiving, by a computer, a map request for a MapReduce application on a distributed file system that includes one or more storage medium;

    receiving, by the computer, parameters for processing the map request, the parameters including a total data size to be processed, a size of each data record, and a number of map requests executing simultaneously;

    determining, by the computer, a cache size for processing the map request, wherein the cache size is determined based on the received parameters for processing the map request and a machine learning model for a map request cache size;

    reading, by the computer, based on the determined cache size, data from the one or more storage medium of the distributed file system into the cache;

    processing, by the computer, the map request; and

    writing, by the computer, an intermediate result data of the map request processing into the cache, based on the determined cache size.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×