Generating a machine learning model for objects based on augmenting the objects with physical properties
First Claim
1. A device, comprising:
- one or more memories; and
one or more processors, communicatively coupled to the one or more memories, to;
receive images of a video stream, three-dimensional models for objects in the images, and physical property data for the objects;
map the three-dimensional models and the physical property data to the objects in the images to generate augmented data sequences with the objects;
apply different physical properties, of the physical property data, to the objects in the augmented data sequences, based on an augmentation policy, to generate augmented data sequences with different applied physical properties;
train a machine learning model based on the images of the video stream to generate a first trained machine learning model;
train the machine learning model, based on the augmented data sequences with the different applied physical properties, to generate a second trained machine learning model;
compare the first trained machine learning model and the second trained machine learning model;
determine whether the second trained machine learning model is optimized based on a result of comparing the first trained machine learning model and the second trained machine learning model; and
provide the second trained machine learning model and the different applied physical properties when the second trained machine learning model is optimized.
1 Assignment
0 Petitions
Accused Products
Abstract
A device receives images of a video stream, models for objects in the images, and physical property data for the objects, and maps the models and the physical property data to the objects in the images to generate augmented data sequences. The device applies different physical properties to the objects in the augmented data sequences to generate augmented data sequences with different applied physical properties, and trains a machine learning (ML) model based on the images to generate a first trained ML model. The device trains the ML model, based on the augmented data sequences with the different applied physical properties, to generate a second trained ML model, and compares the first trained ML model and the second trained ML model. The device determines whether the second trained ML model is optimized based on the comparison, and provides the second trained ML model when optimized.
-
Citations
20 Claims
-
1. A device, comprising:
-
one or more memories; and one or more processors, communicatively coupled to the one or more memories, to; receive images of a video stream, three-dimensional models for objects in the images, and physical property data for the objects; map the three-dimensional models and the physical property data to the objects in the images to generate augmented data sequences with the objects; apply different physical properties, of the physical property data, to the objects in the augmented data sequences, based on an augmentation policy, to generate augmented data sequences with different applied physical properties; train a machine learning model based on the images of the video stream to generate a first trained machine learning model; train the machine learning model, based on the augmented data sequences with the different applied physical properties, to generate a second trained machine learning model; compare the first trained machine learning model and the second trained machine learning model; determine whether the second trained machine learning model is optimized based on a result of comparing the first trained machine learning model and the second trained machine learning model; and provide the second trained machine learning model and the different applied physical properties when the second trained machine learning model is optimized. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer-readable medium storing instructions, the instructions comprising:
one or more instructions that, when executed by one or more processors, cause the one or more processors to; receive images of a video stream, three-dimensional models for objects in the images, and physical property data for the objects, the images of the video stream including metadata that identifies at least two of; the images of the video stream, the objects in the images, classes associated with the objects, boundary boxes for the images, coordinates associated with the objects in the images, or names of the objects, the three-dimensional models including at least two of; three-dimensional representations of the objects, three-dimensional coordinates associated with the objects, normal vectors associated with the objects, or the names of the objects, the physical property data including at least two of; the names of the objects, information associated with deformations of the objects, information associated with gravities for the objects, information associated with rotations of the objects, information associated with renderings of the objects, or information associated with collisions of the objects; map the three-dimensional models and the physical property data to the objects in the images to generate augmented data sequences with the objects; apply different physical properties, of the physical property data, to the objects in the augmented data sequences to generate augmented data sequences with different applied physical properties; train a machine learning model based on the images of the video stream to generate a first machine learning model; train the machine learning model, based on the augmented data sequences with the different applied physical properties, to generate a second machine learning model; test the first machine learning model and the second machine learning model to generate first test results and second test results, respectively; determine whether the second machine learning model is optimized based on comparing the first test results and the second test results; and utilize the second machine learning model and the different applied physical properties, when the second machine learning model is optimized, to make a prediction. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
15. A method, comprising:
-
receiving, by a device, images of a video stream, three-dimensional models for objects in the images, and physical property data for the objects; associating, by the device, the three-dimensional models and the physical property data with the objects in the images to generate augmented data sequences with the objects; receiving, by the device, an augmentation policy; applying, by the device and based on an augmentation policy, different physical properties, of the physical property data, to the objects in the augmented data sequences in order to generate augmented data sequences with different applied physical properties; training, by the device, a machine learning model based on the images of the video stream to generate a first trained machine learning model; training, by the device, the machine learning model, based on the augmented data sequences with the different applied physical properties, to generate a second trained machine learning model; testing, by the device, the first trained machine learning model and the second trained machine learning model to generate first test results and second test results, respectively; determining, by the device, whether the second trained machine learning model is optimized based on whether the second test results are within a predetermined threshold of the first test results; and providing, by the device, the second trained machine learning model and the different applied physical properties when the second trained machine learning model is optimized. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification