User interface for context labeling of multimedia items
First Claim
1. A system for facilitating training of a neural network to associate context information with multimedia items, the system comprising:
- a computer system comprising one or more processors programmed with computer program instructions that, when executed, cause the computer system to;
provide multimedia items to a neural network to cause the neural network to predict one or more labels for the multimedia items, the neural network predicting a label as a corresponding label for first and second multimedia items of the multimedia items;
assign the first and second multimedia items to a group based on the neural network predicting the label as a corresponding label for the first and second multimedia items,generate, based on the predicted label, a task related to the predicted label, the task soliciting a user to indicate one or more of the multimedia items that are relevant to the predicted label;
cause the first multimedia item, the second multimedia item, and the task to be presented together on a user interface at a same time based on the assignment of the first and second multimedia items to the group;
obtain, via the user interface, a user response to the task, the user response comprising (i) a first user indication of the first multimedia item as being relevant to the predicted label and (ii) a second user indication of the second multimedia item as being relevant or not relevant to the predicted label; and
provide, to the neural network, the first and second user indications to cause the neural network to be updated based on the first and second user indications.
1 Assignment
0 Petitions
Accused Products
Abstract
Context information may be associated with multimedia items. One or more multimedia items may be obtained from a repository, the multimedia items(s) including one or more of an image, a video, audio, a text file, combinations thereof, and/or other considerations. Predicted context information may be associated with individual ones of the multimedia items. Labels and/or other context information associated with multimedia item(s) may be stored as metadata associated with the multimedia item(s). A user interface may be configured to display one or more of the obtained multimedia items and the predicted context information associated with the one or more obtained multimedia items. Entry and/or selection may be obtained from one or more users of an addition, removal, correction, and/or confirmation of the predicted context information for individual ones of the multimedia items and/or groups of multimedia items displayed in the user interfaces.
-
Citations
19 Claims
-
1. A system for facilitating training of a neural network to associate context information with multimedia items, the system comprising:
a computer system comprising one or more processors programmed with computer program instructions that, when executed, cause the computer system to; provide multimedia items to a neural network to cause the neural network to predict one or more labels for the multimedia items, the neural network predicting a label as a corresponding label for first and second multimedia items of the multimedia items; assign the first and second multimedia items to a group based on the neural network predicting the label as a corresponding label for the first and second multimedia items, generate, based on the predicted label, a task related to the predicted label, the task soliciting a user to indicate one or more of the multimedia items that are relevant to the predicted label; cause the first multimedia item, the second multimedia item, and the task to be presented together on a user interface at a same time based on the assignment of the first and second multimedia items to the group; obtain, via the user interface, a user response to the task, the user response comprising (i) a first user indication of the first multimedia item as being relevant to the predicted label and (ii) a second user indication of the second multimedia item as being relevant or not relevant to the predicted label; and provide, to the neural network, the first and second user indications to cause the neural network to be updated based on the first and second user indications. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
11. A method for facilitating training of a prediction model to associate context information with multimedia items, the method being implemented by one or more processors executing computer program instructions that, when executed, perform the method, the method comprising:
-
providing multimedia items to a prediction model to cause the prediction model to predict one or more labels for the multimedia items, the prediction model predicting a label as a corresponding label for first and second multimedia items of the multimedia items; assigning the first and second multimedia items to a group based on the prediction model predicting the label as a corresponding label for the first and second multimedia items, generating, based on the predicted label, a task related to the predicted label, the task soliciting a user to indicate one or more of the multimedia items that are relevant to the predicted label; causing the first multimedia item, the second multimedia item, and the task to be presented together on a user interface at a same time based on the assignment of the first and second multimedia items to the group; obtaining, via the user interface, a user response to the task, the user response comprising (i) a first user indication of the first multimedia item as being relevant to the predicted label and (ii) a second user indication of the second multimedia item as being relevant or not relevant to the predicted label; and providing, to the prediction model, the first and second user indications to cause the prediction model to be updated based on the first and second user indications. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A system for facilitating training of a prediction model to associate context information with multimedia items, the system comprising:
a computer system comprising one or more processors programmed with computer program instructions that, when executed, cause the computer system to; provide multimedia items to a prediction model to cause the prediction model to predict one or more labels for the multimedia items, the prediction model predicting a label as a corresponding label for first and second multimedia items of the multimedia items; assign the first and second multimedia items to a group based on the prediction model predicting the label as a corresponding label for the first and second multimedia items generate, based on the predicted label, a task related to the predicted label, the task soliciting a user to indicate one or more of the multimedia items that are relevant to the predicted label; cause the first multimedia item, the second multimedia item, and the task to be presented together on a user interface at a same time based on the assignment of the first and second multimedia items to the group; obtain, via the user interface, a user response to the task, the user response comprising (i) a first user indication of the first multimedia item as being relevant to the predicted label and (ii) a second user indication of the second multimedia item as being relevant or not relevant to the predicted label; and provide, to the prediction model, the first and second user indications to cause the prediction model to be updated based on the first and second user indications. - View Dependent Claims (17, 18, 19)
Specification