Image annotation based on label consensus
First Claim
1. A computer-implemented method executed by one or more processors, the method comprising:
- receiving, by the one or more processors, an initial data set comprising a plurality of images, each image from the plurality of images being associated with a set of labels, wherein each label in the set of labels is assigned to the image of the plurality of images by an initial model, the initial model being trained for a particular ground-truth label;
for each image in the plurality of images in the initial data set;
providing, by the one or more processors, a list of categories associated with the image based on the set of labels assigned to the image by the initial model, anddetermining, by the one or more processors, a primary category of the image based on the list of categories;
determining, by the one or more processors, a category of the ground-truth label, the category having been specified for the ground-truth label of the initial model;
comparing, by the one or more processors, the category of the ground-truth label to primary categories of respective images in the plurality of images in the initial data set;
selecting, by the one or more processors, a revised data set, wherein the revised data set includes only images of the initial data set that are associated with a respective primary category that is the same as the category of the ground-truth label; and
providing, by the one or more processors, the revised data set to retrain the initial model to provide a revised model.
2 Assignments
0 Petitions
Accused Products
Abstract
Implementations include actions of receiving an initial data set including a plurality of images, each image being associated with a set of labels, wherein each label in the set of labels is assigned to a respective image of the plurality of images by an initial model, the initial model being specific to a ground-truth label; for each image in the plurality of images: providing a list of categories associated with a respective image based on a respective set of labels, and determining a primary category of the respective image based on the list of categories; determining a category of the ground-truth label; and providing a revised data set based on the initial data set by comparing the category to primary categories of respective images in the plurality of images, the initial model being trained based on the revised data set to provide a revised model.
-
Citations
18 Claims
-
1. A computer-implemented method executed by one or more processors, the method comprising:
-
receiving, by the one or more processors, an initial data set comprising a plurality of images, each image from the plurality of images being associated with a set of labels, wherein each label in the set of labels is assigned to the image of the plurality of images by an initial model, the initial model being trained for a particular ground-truth label; for each image in the plurality of images in the initial data set; providing, by the one or more processors, a list of categories associated with the image based on the set of labels assigned to the image by the initial model, and determining, by the one or more processors, a primary category of the image based on the list of categories; determining, by the one or more processors, a category of the ground-truth label, the category having been specified for the ground-truth label of the initial model; comparing, by the one or more processors, the category of the ground-truth label to primary categories of respective images in the plurality of images in the initial data set; selecting, by the one or more processors, a revised data set, wherein the revised data set includes only images of the initial data set that are associated with a respective primary category that is the same as the category of the ground-truth label; and providing, by the one or more processors, the revised data set to retrain the initial model to provide a revised model. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising:
-
a data store for storing data; and one or more processors configured to interact with the data store, the one or more processors being further configured to perform operations comprising; receiving an initial data set comprising a plurality of images, each image from the plurality of images being associated with a set of labels, wherein each label in the set of labels is assigned to the image of the plurality of images by an initial model, the initial model being trained for a particular ground-truth label; for each image in the plurality of images in the initial data set; providing a list of categories associated with the image based on the set of labels assigned to the image by the initial model, and determining a primary category of the respective image based on the list of categories; determining a category of the ground-truth label, the category having been specified for the ground-truth label of the initial model; comparing the category of the ground-truth label to primary categories of respective images in the plurality of images in the initial data set; selecting a revised data set, wherein the revised data set includes only images of the initial data set that are associated with a respective primary category that is the same as the category of the ground-truth label; and providing the revised data set to retrain the initial model to provide a revised model. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer readable medium storing instructions that, when executed by one or more processors, cause the one or more processors to perform operations comprising:
-
receiving an initial data set comprising a plurality of images, each image from the plurality of images being associated with a set of labels, wherein each label in the set of labels is assigned to the image of the plurality of images by an initial model, the initial model being trained for a particular ground-truth label in the initial data set; for each image in the plurality of images; providing a list of categories associated with the image based on the set of labels assigned to the image by the initial model, and determining a primary category of the respective image based on the list of categories; determining a category of the ground-truth label, the category having been specified for the ground-truth label of the initial model; comparing the category of the ground-truth label to primary categories of respective images in the plurality of images in the initial data set; selecting a revised data set, wherein the revised data set includes only images of the initial data set that are associated with a respective primary category that is the same as the category of the ground-truth label; and providing the revised data set to retrain the initial model to provide a revised model. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification