Method and system for window-based churn handling in data cache
First Claim
Patent Images
1. A computer-implemented method for controlling access to a cache memory device, the method comprising:
- receiving a request from a client for writing a data segment associated with a file object stored to a storage system;
in response to the request, writing the data segment to one of a plurality of storage units of the storage system;
determining, in the storage system, a hierarchy of file regions of the file object based on access frequency and access recency of the file regions of the file object, wherein a first portion of the hierarchy of file regions is designated as top hot file regions based on a first percentage;
calculating a miss rate of a solid state device (SSD) operating as a cache memory device based on a plurality of previous requests from the client;
automatically adjusting, based on the calculated miss rate of the SSD, the first percentage to a second percentage, the second percentage used to determine a second portion of the hierarchy of file regions as top hot file regions of the file object;
determining whether an amount of access to a solid state device (SSD) operating as a cache memory device exceeds an access threshold based on a time window distribution, wherein the access threshold is a desired average of writes per day (WPD) to the SSD over a predetermined period of time; and
throttling access to the SSD for caching data in response to determining that the amount of access to the SSD has exceeded the access threshold, including writing the data segment to the SSD in response to determining that the data segment is associated with a top hot file region in the second portion of the hierarchy of file regions, and discarding the data segment in response to determining that the data segment is not associated with a top hot file region in the second portion of the hierarchy of file regions.
7 Assignments
0 Petitions
Accused Products
Abstract
According to some embodiment, a backup storage system receives a request from a client for writing a data segment associated with a file object stored to a storage system. In response to the request, the system writes the data segment to one of a plurality of storage units of the storage system. The system determines whether an amount of access to a solid state device (SSD) operating as a cache memory device exceeds an access threshold based on a time window distribution. The system throttles access to the SSD for caching data in response to determining that the amount of access to the SSD has exceeded the access threshold.
-
Citations
22 Claims
-
1. A computer-implemented method for controlling access to a cache memory device, the method comprising:
-
receiving a request from a client for writing a data segment associated with a file object stored to a storage system; in response to the request, writing the data segment to one of a plurality of storage units of the storage system; determining, in the storage system, a hierarchy of file regions of the file object based on access frequency and access recency of the file regions of the file object, wherein a first portion of the hierarchy of file regions is designated as top hot file regions based on a first percentage; calculating a miss rate of a solid state device (SSD) operating as a cache memory device based on a plurality of previous requests from the client; automatically adjusting, based on the calculated miss rate of the SSD, the first percentage to a second percentage, the second percentage used to determine a second portion of the hierarchy of file regions as top hot file regions of the file object; determining whether an amount of access to a solid state device (SSD) operating as a cache memory device exceeds an access threshold based on a time window distribution, wherein the access threshold is a desired average of writes per day (WPD) to the SSD over a predetermined period of time; and throttling access to the SSD for caching data in response to determining that the amount of access to the SSD has exceeded the access threshold, including writing the data segment to the SSD in response to determining that the data segment is associated with a top hot file region in the second portion of the hierarchy of file regions, and discarding the data segment in response to determining that the data segment is not associated with a top hot file region in the second portion of the hierarchy of file regions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor, cause the processor to perform operations for controlling access to a cache memory device, the operations comprising:
-
receiving a request from a client for writing a data segment associated with a file object stored to a storage system; in response to the request, writing the data segment to one of a plurality of storage units of the storage system; determining, in the storage system, a hierarchy of file regions of the file object based on access frequency and access recency of the file regions of the file object, wherein a first portion of the hierarchy of file regions is designated as top hot file regions based on a first percentage; calculating a miss rate of a solid state device (SSD) operating as a cache memory device based on a plurality of previous requests from the client; automatically adjusting, based on the calculated miss rate of the SSD, the first percentage to a second percentage, the second percentage used to determine a second portion of the hierarchy of file regions as top hot file regions of the file object; determining whether an amount of access to a solid state device (SSD) operating as a cache memory device exceeds an access threshold based on a time window distribution, wherein the access threshold is a desired average of writes per day (WPD) to the SSD over a predetermined period of time; and throttling access to the SSD for caching data in response to determining that the amount of access to the SSD has exceeded the access threshold, including writing the data segment to the SSD in response to determining that the data segment is associated with a top hot file region in the second portion of the hierarchy of file regions, and discarding the data segment in response to determining that the data segment is not associated with a top hot file region in the second portion of the hierarchy of file regions. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A data processing system, comprising:
-
a processor; and a memory coupled to the processor to store instructions, which when executed by the processor, cause the processor to perform operations, the operations comprising; receiving a request from a client for writing a data segment associated with a file object stored to a storage system; in response to the request, writing the data segment to one of a plurality of storage units of the storage system; determining, in the storage system, a hierarchy of file regions of the file object based on access frequency and access recency of the file regions of the file object, wherein a first portion of the hierarchy of file regions is designated as top hot file regions based on a first percentage; calculating a miss rate of a solid state device (SSD) operating as a cache memory device based on a plurality of previous requests from the client; automatically adjusting, based on the calculated miss rate of the SSD, the first percentage to a second percentage, the second percentage used to determine a second portion of the hierarchy of file regions as top hot file regions of the file object; determining whether an amount of access to a solid state device (SSD) operating as a cache memory device exceeds an access threshold based on a time window distribution, wherein the access threshold is a desired average of writes per day (WPD) to the SSD over a predetermined period of time; and throttling access to the SSD for caching data in response to determining that the amount of access to the SSD has exceeded the access threshold, including writing the data segment to the SSD in response to determining that the data segment is associated with a top hot file region in the second portion of the hierarchy of file regions, and discarding the data segment in response to determining that the data segment is not associated with a top hot file region in the second portion of the hierarchy of file regions. - View Dependent Claims (18, 19, 20, 21, 22)
-
Specification