NEURAL NETWORK TRAINING UTILIZING SPECIALIZED LOSS FUNCTIONS
First Claim
1. A method, comprising:
- receiving, by a computer system, a training dataset comprising a plurality of images, wherein each image of the training dataset is associated with an identifier of a class of a set of classes;
computing, by a neural network, a plurality of feature vectors, wherein each feature vector of the plurality of feature vectors represents an image of the training dataset in a space of image features;
computing, for the training dataset, a value of a loss function reflecting a plurality of probabilities, wherein each probability of the plurality of probabilities characterizes a hypothesis associating an image of the training dataset with a class associated with the image by the training dataset, wherein the loss function further reflects a plurality of distances, wherein each distance of the plurality of distances is computed in the space of image features between a feature vector representing an image of the training dataset and a center of a class associated with the image by the training dataset; and
adjusting one or more parameters of the neural network based on the value of the loss function.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for neural network training utilizing specialized loss functions. An example method comprises: receiving, by a computer system, a training dataset comprising a plurality of images, wherein each image of the training dataset is associated with an identifier of a class of a set of classes; computing, by a neural network, a plurality of feature vectors, wherein each feature vector of the plurality of feature vectors represents an image of the training dataset in a space of image features; computing, for the training dataset, a value of a loss function reflecting a plurality of probabilities, wherein each probability of the plurality of probabilities characterizes a hypothesis associating an image of the training dataset with a class associated with the image by the training dataset, wherein the loss function further reflects a plurality of distances, wherein each distance of the plurality of distances is computed in the space of image features between a feature vector representing an image of the training dataset and a center of a class associated with the image by the training dataset; and adjusting one or more parameters of the neural network based on the value of the loss function.
-
Citations
20 Claims
-
1. A method, comprising:
-
receiving, by a computer system, a training dataset comprising a plurality of images, wherein each image of the training dataset is associated with an identifier of a class of a set of classes; computing, by a neural network, a plurality of feature vectors, wherein each feature vector of the plurality of feature vectors represents an image of the training dataset in a space of image features; computing, for the training dataset, a value of a loss function reflecting a plurality of probabilities, wherein each probability of the plurality of probabilities characterizes a hypothesis associating an image of the training dataset with a class associated with the image by the training dataset, wherein the loss function further reflects a plurality of distances, wherein each distance of the plurality of distances is computed in the space of image features between a feature vector representing an image of the training dataset and a center of a class associated with the image by the training dataset; and adjusting one or more parameters of the neural network based on the value of the loss function. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system, comprising:
-
a memory; a processor, coupled to the memory, the processor configured to; receive a training dataset comprising a plurality of images, wherein each image of the training dataset is associated with an identifier of a class of a set of classes; compute, by a neural network, a plurality of feature vectors, wherein each feature vector of the plurality of feature vectors represents an image of the training dataset in a space of image features; compute, for the training dataset, a value of a loss function reflecting a plurality of probabilities, wherein each probability of the plurality of probabilities characterizes a hypothesis associating an image of the training dataset with a class associated with the image by the training dataset, wherein the loss function further reflects a plurality of distances, wherein each distance of the plurality of distances is computed in the space of image features between a feature vector representing an image of the training dataset and a center of a class associated with the image by the training dataset; and adjust one or more parameters of the neural network based on the value of the loss function. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computer-readable non-transitory storage medium comprising executable instructions that, when executed by a computer system, cause the computer system to:
-
receive a training dataset comprising a plurality of images, wherein each image of the training dataset is associated with an identifier of a class of a set of classes; compute, by a neural network, a plurality of feature vectors, wherein each feature vector of the plurality of feature vectors represents an image of the training dataset in a space of image features; compute, for the training dataset, a value of a loss function reflecting a plurality of probabilities, wherein each probability of the plurality of probabilities characterizes a hypothesis associating an image of the training dataset with a class associated with the image by the training dataset, wherein the loss function further reflects a plurality of distances, wherein each distance of the plurality of distances is computed in the space of image features between a feature vector representing an image of the training dataset and a center of a class associated with the image by the training dataset; and adjust one or more parameters of the neural network based on the value of the loss function. - View Dependent Claims (17, 18, 19, 20)
-
Specification