Classification of data objects in a distributed file system based on application creation and/or access information
First Claim
1. A computer-implemented method comprising:
- retrieving, by a cloud-based server, data objects from a distributed file system;
processing, by the cloud-based server, the data objects to determine applications on one or more remote computing devices that created or accessed respective ones of the data objects;
classifying the data objects according to corresponding applications that created or accessed the data objects resulting in application classification information, the application classification information associating each data object with a set of applications that created or accessed the data object;
storing the application classification information in a cloud-based memory system associated with the cloud-based server;
identifying modified data objects in the distributed file based system;
accessing, by the cloud-based server, a synchronization priority scheme for a particular remote computing device, wherein the synchronization priority scheme indicates a priority for synchronizing the modified data objects to the particular remote computing device; and
modifying, by the cloud-based server, the synchronization priority scheme based on the application classification information and a plurality of applications installed on the particular remote computing device,wherein a modified data object is deprioritized in the synchronization priority scheme when the plurality of applications installed on the particular remote computing device and the associated set of applications that created or accessed the modified data object are disjoint, wherein a regular modified data object is batched in a first synchronization batch, wherein a deprioritized modified data object is batched in a second synchronization batch, wherein data objects in the second synchronization batch are synchronized to the particular remote computing device based on a type of an access network, and wherein data objects in the first synchronization batch are synchronized to the particular remote computing device regardless of the type of the access network.
4 Assignments
0 Petitions
Accused Products
Abstract
Technology is disclosed for classification of data objects (e.g., files) in a distributed file system based on application creation and/or access information. In some embodiments, data objects of a user are stored across multiple computing devices associated with a user, e.g., mobile computing devices such as a tablet, a laptop, or a smartphone, and/or at server systems, e.g., cloud-based storage servers, in a distributed cloud-based storage system. The technology described tracks the applications that create or touch particular data objects in the distributed file system and classifies the data objects according to these applications resulting in various application classifications. For example, a classification engine can process data objects in the distributed file system to identify an application that created the data object and/or applications that have accessed the data object. The classification information can be saved by, for example, a cloud-based storage server.
5 Citations
16 Claims
-
1. A computer-implemented method comprising:
-
retrieving, by a cloud-based server, data objects from a distributed file system; processing, by the cloud-based server, the data objects to determine applications on one or more remote computing devices that created or accessed respective ones of the data objects; classifying the data objects according to corresponding applications that created or accessed the data objects resulting in application classification information, the application classification information associating each data object with a set of applications that created or accessed the data object; storing the application classification information in a cloud-based memory system associated with the cloud-based server; identifying modified data objects in the distributed file based system; accessing, by the cloud-based server, a synchronization priority scheme for a particular remote computing device, wherein the synchronization priority scheme indicates a priority for synchronizing the modified data objects to the particular remote computing device; and modifying, by the cloud-based server, the synchronization priority scheme based on the application classification information and a plurality of applications installed on the particular remote computing device, wherein a modified data object is deprioritized in the synchronization priority scheme when the plurality of applications installed on the particular remote computing device and the associated set of applications that created or accessed the modified data object are disjoint, wherein a regular modified data object is batched in a first synchronization batch, wherein a deprioritized modified data object is batched in a second synchronization batch, wherein data objects in the second synchronization batch are synchronized to the particular remote computing device based on a type of an access network, and wherein data objects in the first synchronization batch are synchronized to the particular remote computing device regardless of the type of the access network. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer-implemented method comprising:
-
retrieving, by a cloud-based server, data objects from a distributed file system; processing, by the cloud-based server, the data objects to determine applications on one or more remote computing devices that created or accessed respective ones of the data objects; classifying the data objects according to corresponding applications that created or accessed the data objects resulting in application classification information, wherein classifying a particular data object comprises marking, linking or tagging the particular data object with a corresponding application that created or accessed the particular data object; identifying modified data objects in the distributed file based system; accessing, by the cloud-based server, a synchronization priority scheme for a particular remote computing device, wherein the synchronization priority scheme indicates a priority for synchronizing the modified data objects to the particular remote computing device; modifying, by the cloud-based server, the synchronization priority scheme based on the application classification information and a plurality of applications installed on the particular remote computing device; and synchronizing, by the cloud-based server, the particular remote computing device based on the modified synchronization priority scheme, wherein a modified data object is deprioritized in the synchronization priority scheme when the plurality of applications installed on the particular remote computing device and the associated set of applications that created or accessed the modified data object are disjoint, wherein a regular modified data object is batched in a first synchronization batch, wherein a deprioritized modified data object is batched in a second synchronization batch, wherein data objects in the second synchronization batch are synchronized to the particular remote computing device based on a type of an access network, and wherein data objects in the first synchronization batch are synchronized to the particular remote computing device regardless of the type of the access network. - View Dependent Claims (10, 11)
-
-
12. A cloud-based server system comprising:
-
a processor; a memory system having instructions stored thereon, wherein the instructions, when executed by the processor, cause the cloud-based system to; retrieve data objects from a distributed file system; process the data objects to determine applications on one or more remote computing devices that created or accessed respective ones of the data objects; classify the data objects according to corresponding applications that created or accessed the data objects resulting in application classification information, the application classification information associating each data object with a set of applications that created or accessed the data object; store the application classification information in a cloud-based memory system associated with the cloud-based server; access, by the cloud-based server, a synchronization priority scheme for a particular remote computing device, wherein the synchronization priority scheme indicates a priority for synchronizing the modified data objects to the particular remote computing device; modify, by the cloud-based server, the synchronization priority scheme based on the application classification information and a plurality of applications installed on the particular remote computing device; and synchronize, by the cloud-based server, the particular remote computing device based on the modified synchronization priority scheme, wherein a modified data object is deprioritized in the synchronization priority scheme when the plurality of applications installed on the particular remote computing device and the associated set of applications that created or accessed the modified data object are disjoint, wherein a regular modified data object is batched in a first synchronization batch, wherein a deprioritized modified data object is batched in a second synchronization batch, wherein data objects in the second synchronization batch are synchronized to the particular remote computing device based on a type of an access network, and wherein data objects in the first synchronization batch are synchronized to the particular remote computing device regardless of the type of the access network. - View Dependent Claims (13, 14)
-
-
15. A non-transitory computer-readable storage medium storing instructions to be implemented by a computer system having a processor, wherein the instructions, when executed by the processor, cause the computer system to:
-
retrieve data objects from a distributed file system; process the data objects to determine applications on one or more remote computing devices that created or accessed respective ones of the data objects; classify the data objects according to corresponding applications that created or accessed the data objects resulting in application classification information, the application classification information associating each data object with a set of applications that created or accessed the data object; store the application classification information in a cloud-based memory system associated with the computer system; identify modified data objects in the distributed file based system; access a synchronization priority scheme for a particular remote computing device, wherein the synchronization priority scheme indicates a priority for synchronizing the modified data objects to the particular remote computing device; modify the synchronization priority scheme based on the application classification information and a plurality of applications installed on the particular remote computing device; and synchronize the particular remote computing device based on the modified synchronization priority scheme, wherein a modified data object is deprioritized in the synchronization priority scheme when the plurality of applications installed on the particular remote computing device and the associated set of applications that created or accessed the modified data object are disjoint, wherein a regular modified data object is batched in a first synchronization batch, wherein a deprioritized modified data object is batched in a second synchronization batch, wherein data objects in the second synchronization batch are synchronized to the particular remote computing device based on a type of an access network, and wherein data objects in the first synchronization batch are synchronized to the particular remote computing device regardless of the type of the access network. - View Dependent Claims (16)
-
Specification