Determining a likelihood of a resource experiencing a problem based on telemetry data
First Claim
1. A method comprising:
- defining a feature set, an individual feature in the feature set being related to telemetry information associated with a monitored resource, the monitored resource comprising at least one of an application, a device, or a network of devices;
receiving, via a network at a first processing node of first computing infrastructure and from a plurality of other processing nodes of a plurality of other computing infrastructures, a plurality of local models that individually comprise a set of local model parameters computed via stochastic gradient descent (SGD) based at least in part on a training data subset that includes multiple data instances of the feature set and, for each data instance of the feature set, a label indicating whether the monitored resource or a user of the monitored resource experiences a problem with respect to performance or completion of one or more operations or tasks, wherein the plurality of local models and the sets of local model parameters comprised therein are computed in parallel by the plurality of other processing nodes based at least in part on a set of starting model parameters;
receiving, at the first processing node and from the plurality of other processing nodes, a plurality of symbolic representations associated with the plurality of local models, wherein an individual symbolic representation associated with an individual local model is computed to represent how an adjustment to the set of starting model parameters affects the set of local model parameters computed for the individual local model by shifting the set of starting model parameters to a known set of starting model parameters associated with an output of another local model;
combining, at the first processing node using the plurality of symbolic representations to honor sequential dependencies of SGD, the plurality of local models received from the plurality of other processing nodes with a local model computed at the first processing node, the combining generating a global model that includes a set of global model parameters, the global model configured to determine, given a new data instance of the feature set, a likelihood of another monitored resource or another user of the other monitored resource experiencing the problem;
generating, at the first processing node, the new data instance of the feature set based on new telemetry data associated with the other monitored resource; and
determining, using the global model and the new data instance of the feature set, the likelihood of the other monitored resource or the other user of the other monitored resource experiencing the problem.
1 Assignment
0 Petitions
Accused Products
Abstract
Described herein is a system that transmits and combines local models, that individually comprise a set of local parameters computed via stochastic gradient descent (SGD), into a global model that comprises a set of global model parameters. The local models are computed in parallel at different geographic locations along with symbolic representations. Network transmission of the local models and the symbolic representations, rather than transmission of the large training data subsets processed to compute the local models and symbolic representations, conserves resources and decreases latency. The global model can then be used as a model to determine a likelihood of a monitored resource or a user of the monitored resource experiencing a problem with respect to performance or completion of one or more operations. The system can also implement an action to assist in resolving or avoiding the problem.
40 Citations
20 Claims
-
1. A method comprising:
-
defining a feature set, an individual feature in the feature set being related to telemetry information associated with a monitored resource, the monitored resource comprising at least one of an application, a device, or a network of devices; receiving, via a network at a first processing node of first computing infrastructure and from a plurality of other processing nodes of a plurality of other computing infrastructures, a plurality of local models that individually comprise a set of local model parameters computed via stochastic gradient descent (SGD) based at least in part on a training data subset that includes multiple data instances of the feature set and, for each data instance of the feature set, a label indicating whether the monitored resource or a user of the monitored resource experiences a problem with respect to performance or completion of one or more operations or tasks, wherein the plurality of local models and the sets of local model parameters comprised therein are computed in parallel by the plurality of other processing nodes based at least in part on a set of starting model parameters; receiving, at the first processing node and from the plurality of other processing nodes, a plurality of symbolic representations associated with the plurality of local models, wherein an individual symbolic representation associated with an individual local model is computed to represent how an adjustment to the set of starting model parameters affects the set of local model parameters computed for the individual local model by shifting the set of starting model parameters to a known set of starting model parameters associated with an output of another local model; combining, at the first processing node using the plurality of symbolic representations to honor sequential dependencies of SGD, the plurality of local models received from the plurality of other processing nodes with a local model computed at the first processing node, the combining generating a global model that includes a set of global model parameters, the global model configured to determine, given a new data instance of the feature set, a likelihood of another monitored resource or another user of the other monitored resource experiencing the problem; generating, at the first processing node, the new data instance of the feature set based on new telemetry data associated with the other monitored resource; and determining, using the global model and the new data instance of the feature set, the likelihood of the other monitored resource or the other user of the other monitored resource experiencing the problem. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
one or more processing units; and a computer-readable medium having encoded thereon computer-executable instructions to cause the one or more processing units to; receive, via a network at a first processing node and from a plurality of other processing nodes, a plurality of local models that individually comprise a set of local model parameters computed via stochastic gradient descent (SGD) based at least in part on a training data subset that includes multiple data instances of a feature set and, for each data instance of the feature set, a label indicating whether the monitored resource or a user of the monitored resource experiences a problem with respect to performance or completion of one or more operations or tasks, wherein the plurality of local models and the sets of local model parameters comprised therein are computed in parallel by the plurality of other processing nodes based at least in part on a set of starting model parameters; receive, at the first processing node and from the plurality of other processing nodes, a plurality of symbolic representations associated with the plurality of local models, wherein an individual symbolic representation associated with an individual local model is computed to represent how an adjustment to the set of starting model parameters affects the set of local model parameters computed for the individual local model by shifting the set of starting model parameters to a known set of starting model parameters associated with an output of another local model; combine, at the first processing node using the plurality of symbolic representations to honor sequential dependencies of SGD, the plurality of local models received from the plurality of other processing nodes with a local model computed at the first processing node, the combining generating a global model that includes a set of global model parameters, the global model configured to determine, given a new data instance of the feature set, a likelihood of another monitored resource or another user of the other monitored resource experiencing the problem; generate the new data instance of the feature set based on new telemetry data associated with the other monitored resource; and determine, using the global model and the new data instance of the feature set, the likelihood of the other monitored resource or the other user of the other monitored resource experiencing the problem. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A system comprising:
-
one or more processing units; and a computer-readable medium having encoded thereon computer-executable instructions to cause the one or more processing units to; compute a local model that comprises a set of local model parameters computed via stochastic gradient descent (SGD) based at least in part on a training data subset that includes multiple data instances of a feature set and, for each data instance of the feature set, a label indicating whether the monitored resource or a user of the monitored resource experiences a problem with respect to performance or completion of one or more operations or tasks; compute a symbolic representation associated with the local model, wherein the symbolic representation comprises a matrix that represents how an adjustment to a set of starting model parameters affects the set of local model parameters computed for the local model by shifting the set of starting model parameters to a known set of starting model parameters associated with an output of another local model; reduce a size of the matrix by projecting the matrix from a first dimensional space to a second dimensional space of smaller dimension; and transmit the local model and the symbolic representation to processing nodes over a network to enable a global model to be generated, the global model useable to determine, given a new data instance of the feature set, a likelihood of another monitored resource or another user of the other monitored resource experiencing the problem.
-
Specification