Structured labeling to facilitate concept evolution in machine learning
First Claim
Patent Images
1. A computer-implemented method for multimedia content labeling, the method comprising:
- providing a plurality of multimedia content items for presentation at a graphical user interface (“
GUI”
), the plurality of multimedia content items comprising at least one of web pages or images;
providing a concept prompt for presentation at the GUI, the concept prompt identifying a concept and prompting a user to indicate whether the plurality of multimedia content items are associated with the concept;
providing a plurality of categories for presentation at the GUI, the plurality of categories having labels indicating user responses to the concept prompt;
receiving an indication of a first user input, the first user input indicating an association between a first multimedia content item of the plurality of multimedia content items and a first category of the plurality of categories, the first category having a first label indicating a user response to the concept prompt for the first multimedia content item;
forming a first group that is presented within the first category, the first group including the first multimedia content item;
receiving a user-supplied tag that describes the first group and providing the user-supplied tag for presentation in association with the first group;
determining that a second multimedia content item is uncategorized and, based on the determination that the second multimedia content item is uncategorized and on at least one of an item-to-group similarity or an item-to-item similarity, generating a suggestion that the second multimedia content item be associated with the first group within the first category, wherein the at least one of the item-to-group similarity or the item-to-item similarity is determined based on at least one of a shortest link, a cosine similarity metric, or a term frequency-inverse document frequency;
providing the suggestion for presentation at the GUI; and
training a machine-learning algorithm with data indicating the association between the first multimedia content item and the first category to provide an improved machine-learning algorithm.
2 Assignments
0 Petitions
Accused Products
Abstract
A system, method, and media are provided for generating a structured labeling graphical user interface. The user interface receives user input that associates multimedia content with categories. The user input may include user-supplied tags that further define the category for the multimedia content. The user-supplied tags are rendered proximate to the categories. In turn, a database logs user events to store, among other things, the categories, the user-supplied tags, time associated with completing the user-supplied tags, and time for associating multimedia content with the categories or tags.
118 Citations
20 Claims
-
1. A computer-implemented method for multimedia content labeling, the method comprising:
-
providing a plurality of multimedia content items for presentation at a graphical user interface (“
GUI”
), the plurality of multimedia content items comprising at least one of web pages or images;providing a concept prompt for presentation at the GUI, the concept prompt identifying a concept and prompting a user to indicate whether the plurality of multimedia content items are associated with the concept; providing a plurality of categories for presentation at the GUI, the plurality of categories having labels indicating user responses to the concept prompt; receiving an indication of a first user input, the first user input indicating an association between a first multimedia content item of the plurality of multimedia content items and a first category of the plurality of categories, the first category having a first label indicating a user response to the concept prompt for the first multimedia content item; forming a first group that is presented within the first category, the first group including the first multimedia content item; receiving a user-supplied tag that describes the first group and providing the user-supplied tag for presentation in association with the first group; determining that a second multimedia content item is uncategorized and, based on the determination that the second multimedia content item is uncategorized and on at least one of an item-to-group similarity or an item-to-item similarity, generating a suggestion that the second multimedia content item be associated with the first group within the first category, wherein the at least one of the item-to-group similarity or the item-to-item similarity is determined based on at least one of a shortest link, a cosine similarity metric, or a term frequency-inverse document frequency; providing the suggestion for presentation at the GUI; and training a machine-learning algorithm with data indicating the association between the first multimedia content item and the first category to provide an improved machine-learning algorithm. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. One or more computer storage devices having computer-executable instructions embodied thereon that, when executed, perform a method of generating a structured labeling graphical user interface (“
- GUI”
), the method comprising;providing a concept prompt for presentation at a GUI, the concept prompt identifying a concept and prompting a user to indicate whether multimedia content items are associated with the concept, the GUI having at least two portions, a first portion for displaying the multimedia content items and a second portion for assigning the multimedia content items to one of at least two categories, the at least two categories having labels indicating user responses to the concept prompt, and each of the at least two categories including a plurality of groups that are presented within the respective category; based on a plurality of inputs provided by the user, associating each of the multimedia content items displayed in the first portion with a group of the plurality of groups included within one of the at least two categories in the second portion of the GUI; analyzing the multimedia content items associated with a first group of the plurality of groups, wherein analyzing the multimedia content items comprises determining search terms associated with the multimedia content items; based on the analysis, automatically generating a summary for the first group, the summary comprising at least one of the search terms as a suggested description for the multimedia content items included in the first group; providing the summary for output; based on at least one of an item-to-group similarity or an item-to-item similarity, providing a suggestion to associate an uncategorized multimedia content item with a particular group, wherein the at least one of the item-to-group similarity or the item-to-item similarity is determined based on at least one of a shortest link, a cosine similarity metric, or a term frequency-inverse document frequency; and training a machine-learning algorithm with data indicating the associations between the multimedia content items and the plurality of groups. - View Dependent Claims (14, 15, 19, 20)
- GUI”
-
16. A computer system comprising:
-
one or more processors; and computer storage memory having stored thereon computer-executable instructions that, when executed by the one or more processors, implement a method comprising; providing a plurality of multimedia content items for presentation at a graphical user interface (“
GUI”
);providing a concept prompt for presentation at the GUI, the concept prompt identifying a concept and prompting a user to indicate whether the plurality of multimedia content items are associated with the concept; providing a plurality of categories for presentation at the GUI, the plurality of categories having labels indicating user responses to the concept prompt; receiving an indication of a first user input, the first user input indicating an association between a first multimedia content item of the plurality of multimedia content items and a first group that is presented within a first category of the plurality of categories, the first category having a first label indicating a user response to the concept prompt for the first multimedia content item; providing a user-supplied tag for presentation in association with the first group; based on at least one of an item-to-group similarity or an item-to-item similarity, generating a suggestion that a second multimedia content item be associated with the first group, wherein the at least one of the item-to-group similarity or the item-to-item similarity is determined based on at least one of a shortest link, a cosine similarity metric, or a term frequency-inverse document frequency; providing the suggestion for presentation at the GUI; and training a machine-learning algorithm with data indicating the association between the first multimedia content item and the first category. - View Dependent Claims (17, 18)
-
Specification