Method and system for file relocation
First Claim
Patent Images
1. A computer-implemented method, comprising:
- reading history information on access requests for each file of a plurality of files from a memory;
obtaining an access request frequency for each file of the plurality of files;
defining a plurality of time periods;
associating each file of the plurality of files with only one of the plurality of defined time periods in which the respective file was access requested the largest number of times, thereby defining a unique set of files associated with each of the defined time periods, wherein each file appears only once in all of the sets of files associated with the defined time periods;
classifying the set of files associated with each of the defined time periods into a plurality of groups of files, each group of files comprising a plurality of files with similar access request frequencies;
relocating each group of files, each group being relocated to one of the storage devices, wherein the access request history information includes a list of access requests containing identification information of each file accessed, the access requests listed in the order that they are received; and
determining the quantity of simultaneous access requests that each of the plurality of files has been requested to be accessed with a concerned file, wherein access requests within a defined time period surrounding an access request of the concerned file are regarded as being accessed simultaneously with the concerned file;
and further comprising;
(a) calculating a co-occurrence vector for each file to be relocated, the co-occurrence vector containing an access request frequency component, the access request frequency component comprising the frequency that each file to be relocated was access requested simultaneously with the concerned file;
(b) determining a reference co-occurrence vector, the reference co-occurrence vector containing an access request frequency component of a most requested file of the plurality of files to be relocated;
(c) calculating an inner product of the co-occurrence vector of each file of the plurality of files to be relocated;
(d) classifying files having an inner product equal to or greater than a predetermined value into a group, the group including the most requested file that contains the reference co-occurrence vector; and
(e) acquiring K, where K is the number of storage devices into which files are to be relocated; and
(f) repeating steps b through d for ungrouped files K−
1 number of times, on condition that there is a file whose inner product with the reference co-occurrence vector is equal to or greater than the predetermined value.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for file relocation is provided. A file relocation system is connected to multiple storage devices. The system obtains a number of times that each of a number of files stored on the multiple storage devices is requested simultaneously with a concerned file. The relocation system then classifies multiple files into multiple groups, such that groups of files simultaneously are put together into one group. Multiple files classified in to the same group are then relocated to a single storage device. The relocation system then switches a storage device not accessed for a certain period of time into a power-saving mode.
51 Citations
12 Claims
-
1. A computer-implemented method, comprising:
-
reading history information on access requests for each file of a plurality of files from a memory; obtaining an access request frequency for each file of the plurality of files; defining a plurality of time periods; associating each file of the plurality of files with only one of the plurality of defined time periods in which the respective file was access requested the largest number of times, thereby defining a unique set of files associated with each of the defined time periods, wherein each file appears only once in all of the sets of files associated with the defined time periods; classifying the set of files associated with each of the defined time periods into a plurality of groups of files, each group of files comprising a plurality of files with similar access request frequencies; relocating each group of files, each group being relocated to one of the storage devices, wherein the access request history information includes a list of access requests containing identification information of each file accessed, the access requests listed in the order that they are received; and determining the quantity of simultaneous access requests that each of the plurality of files has been requested to be accessed with a concerned file, wherein access requests within a defined time period surrounding an access request of the concerned file are regarded as being accessed simultaneously with the concerned file; and further comprising; (a) calculating a co-occurrence vector for each file to be relocated, the co-occurrence vector containing an access request frequency component, the access request frequency component comprising the frequency that each file to be relocated was access requested simultaneously with the concerned file; (b) determining a reference co-occurrence vector, the reference co-occurrence vector containing an access request frequency component of a most requested file of the plurality of files to be relocated; (c) calculating an inner product of the co-occurrence vector of each file of the plurality of files to be relocated; (d) classifying files having an inner product equal to or greater than a predetermined value into a group, the group including the most requested file that contains the reference co-occurrence vector; and (e) acquiring K, where K is the number of storage devices into which files are to be relocated; and (f) repeating steps b through d for ungrouped files K−
1 number of times, on condition that there is a file whose inner product with the reference co-occurrence vector is equal to or greater than the predetermined value. - View Dependent Claims (2, 3, 4)
-
-
5. A computer program product for controlling input and output of a plurality of storage devices, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, wherein the computer readable storage medium is not a transitory signal per se, the program instructions executable by a computer to cause the computer to perform a method comprising:
-
reading, by the computer, history information on access requests for each file of a plurality of files from a memory of the controller; obtaining, by the computer, an access request frequency for each file of the plurality of files; defining, by the computer, a plurality of time periods; associating, by the computer, each file of the plurality of files with only one of the plurality of defined time periods in which the respective file was access requested the largest number of times, thereby defining a unique set of files associated with each of the defined time periods, wherein each file appears only once in all of the sets of files associated with the defined time periods; classifying, by the computer, the set of files associated with each of the defined time periods into a plurality of groups of files, each group of files comprising a plurality of files with similar access request frequencies; relocating, by the computer, each group of files, each group being relocated to one of the storage devices, wherein the access request history information includes a list of access requests containing identification information of each file accessed, the access requests listed in the order that they are received; determining, by the computer, the quantity of simultaneous access requests that each of the plurality of files has been requested to be accessed with a concerned file, wherein access requests within a defined time period surrounding an access request of the concerned file are regarded as being accessed simultaneously with the concerned file; and (a) calculating, by the computer, a co-occurrence vector for each file to be relocated, the co-occurrence vector containing an access request frequency component, the access request frequency component comprising the frequency that each file to be relocated was access requested simultaneously with the concerned file; (b) determining, by the computer, a reference co-occurrence vector, the reference co-occurrence vector containing an access request frequency component of a most requested file of the plurality of files to be relocated; (c) calculating, by the computer, an inner product of the co-occurrence vector of each file of the plurality of files to be relocated; (d) classifying, by the computer, files having an inner product equal to or greater than a predetermined value into a group, the group including the most requested file that contains the reference co-occurrence vector; and (e) acquiring, by the computer, K, where K is the number of storage devices into which files are to be relocated; and (f) repeating, by the computer, steps b through d for ungrouped files K−
1 number of times, on condition that there is a file whose inner product with the reference co-occurrence vector is equal to or greater than the predetermined value. - View Dependent Claims (6, 7, 8)
-
-
9. An apparatus, comprising:
-
a history information storage configured to store history information of access requests to each of the plurality of files; a counter configured to obtain an access request frequency for each file of the plurality of files; a time period defining unit configured to define a plurality of time periods; an associating unit configured to associate each file of the plurality of files with only one of the plurality of defined time periods in which the respective file was access requested the largest number of times, thereby defining a unique set of files associated with each of the defined time periods, wherein each file appears only once in all of the sets of files associated with the defined time periods; a classifier configured to classify the set of files associated with each of the defined time periods into a plurality of groups of files, each group of files comprising a plurality of files with similar access request frequencies; and a relocating unit configured to relocate each group of files, each group being relocated to one of the storage devices, wherein the access request history information includes a list of access requests containing identification information of each file accessed, the access requests listed in the order that they are received; a unit configured to determine the quantity of simultaneous access requests that each of the plurality of files has been requested to be accessed with a concerned file, wherein access requests within a defined time period surrounding an access request of the concerned file are regarded as being accessed simultaneously with the concerned file; a co-occurrence vector calculating unit configured to calculate a co-occurrence vector for each file to be relocated, the co-occurrence vector containing an access request frequency component, the access request frequency component comprising the frequency that each file to be relocated was access requested simultaneously with the concerned file; a co-occurrence vector determining unit configured to determine a reference co-occurrence vector, the reference co-occurrence vector containing an access request frequency component of a most requested file of the plurality of files to be relocated; a judging unit configured to calculate an inner product of the co-occurrence vector of each file of the plurality of files to be relocated, wherein the classifier is configured to classify files having an inner product equal to or greater than a predetermined value into a group, the group including the most requested file that contains the reference co-occurrence vector. - View Dependent Claims (10, 11, 12)
-
Specification