Shared file system predictive storage techniques
First Claim
1. A method of predictive data storage for a node device in a data distribution system, the method comprising using at least one processor to perform operations of:
- scoring each particular file system element of a plurality of file system elements of a collection of the data distribution system, the scoring using a calculated probability that a user of the node device is likely to interact with the particular file system element;
calculating an on-demand subset of the collection based upon the scores of the plurality of file system elements, wherein the on-demand subset includes fewer elements than the collection, wherein calculating the on-demand subset includes selecting a combination of the plurality of file system elements that results in a highest combined score of selected file system elements given a file system element size constraint on a maximum total size of the elements in the on-demand subset; and
causing at least one file system element of the on-demand subset to be placed in local storage of the node device in response to a determination that the at least one file system element of the on-demand subset is not already in the local storage of the node device.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed in some examples are predictive storage techniques for use in a distributed data system. The predictive storage techniques may be used to manage locally stored elements of a shared data collection, such as the storage of files on nodes of the distributed data system that are limited in local storage space. The predictive storage techniques may achieve a balance between consumption of local resources and timely access of important elements in the shared data collection. For example, the predictive storage techniques may be used for keeping or pre-caching certain items of a collection that are determined as likely to be used in local storage for convenient access, and allowing access the remaining items on request over a network.
-
Citations
21 Claims
-
1. A method of predictive data storage for a node device in a data distribution system, the method comprising using at least one processor to perform operations of:
-
scoring each particular file system element of a plurality of file system elements of a collection of the data distribution system, the scoring using a calculated probability that a user of the node device is likely to interact with the particular file system element; calculating an on-demand subset of the collection based upon the scores of the plurality of file system elements, wherein the on-demand subset includes fewer elements than the collection, wherein calculating the on-demand subset includes selecting a combination of the plurality of file system elements that results in a highest combined score of selected file system elements given a file system element size constraint on a maximum total size of the elements in the on-demand subset; and causing at least one file system element of the on-demand subset to be placed in local storage of the node device in response to a determination that the at least one file system element of the on-demand subset is not already in the local storage of the node device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for predictive data storage for a node device in a data distribution system, the system comprising:
-
a prediction module comprising circuitry to; score each particular file system element of a plurality of file system elements of a collection of the data distribution system, the score based upon a calculated probability that a user of the node device is likely to interact with the particular file system element; a control module comprising circuitry to; calculate an on-demand subset of the collection based upon the scores of the plurality of file system elements, wherein the on-demand subset includes fewer elements than the collection, wherein calculating the on-demand subset includes selecting a combination of the plurality of file system elements that results in a highest combined score of selected file systems elements given a file system element size constraint on a maximum total size of the elements in the on-demand subset; and cause at least one file system element of the on-demand subset to be placed in local storage of the node responsive to a determination that the at least one file system element of the on-demand subset is not already in the local storage of the node device. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A non-transitory machine-readable medium, for predictive data storage for a node device in a data distribution system, the machine-readable medium including instructions, which when performed by the machine, cause the machine to perform operations of:
-
scoring each particular file system element of a plurality of file system elements of a collection of the data distribution system, the scoring using a calculated probability that a user of the node device is likely to interact with the particular file system element; calculating an on-demand subset of the collection based upon the scores of the plurality of file system elements, wherein the on-demand subset includes fewer elements than the collection, wherein calculating the on-demand subset includes selecting a combination of the plurality of file system elements that results in a highest combined score of selected file system elements given a file system element size constraint on a maximum total size of the elements in the on-demand subset; and causing at least one file system element of the on-demand subset to be placed in local storage of the node device in response to a determination that the at least one file system element of the on-demand subset is not already in the local storage of the node device. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21)
-
Specification