INTERACTIVE CONCEPT EDITING IN COMPUTER-HUMAN INTERACTIVE LEARNING
First Claim
1. One or more computer-readable media having embodied thereon computer-usable instructions that, when executed, facilitate a method of interactively generating dictionaries for machine learning, the method comprising:
- presenting a user interface for generating a dictionary that includes a list of one or both of words or n-grams that define a concept usable as a feature for training a classifier;
presenting on the user interface a positive-example field configured to receive user-input words or n-grams that are positive examples of the concept, wherein the positive examples are received from one or more ofA) a typed entry orB) a selection of one or more suggested words or n-grams fromone or more suggestion-set fields; and
presenting on the user interface the one or more suggestion-set fields configured to display one or more system-generated lists that contain suggested words or n-grams that are selectable for inclusion in the positive-example field.
3 Assignments
0 Petitions
Accused Products
Abstract
A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
41 Citations
20 Claims
-
1. One or more computer-readable media having embodied thereon computer-usable instructions that, when executed, facilitate a method of interactively generating dictionaries for machine learning, the method comprising:
-
presenting a user interface for generating a dictionary that includes a list of one or both of words or n-grams that define a concept usable as a feature for training a classifier; presenting on the user interface a positive-example field configured to receive user-input words or n-grams that are positive examples of the concept, wherein the positive examples are received from one or more of A) a typed entry or B) a selection of one or more suggested words or n-grams from one or more suggestion-set fields; and presenting on the user interface the one or more suggestion-set fields configured to display one or more system-generated lists that contain suggested words or n-grams that are selectable for inclusion in the positive-example field. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. One or more computer-readable media having embodied thereon computer-usable instructions that, when executed, facilitate a method of interactively generating dictionaries for machine learning, the method comprising:
-
presenting a user interface for generating dictionaries, wherein a dictionary includes a list of n-grams that define a concept, and wherein the dictionary is usable as a feature for training a classifier; presenting, on the user interface, a positive-example input field configured to receive user-input n-grams that are positive examples of the concept; presenting on the user interface one or more suggestion-set fields configured to display one or more system-generated lists of suggested n-grams; receiving one or more user-input n-grams that are positive examples of the concept; generating a first set of suggested n-grams that represent a first generalized concept based on the one or more user-input positive examples; and presenting the first set of suggested n-grams in a first suggestion-set field on the user interface. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. One or more computer-readable media having embodied thereon computer-usable instructions that, when executed, facilitate a method of interactively generating dictionaries for machine learning, the method comprising:
-
generating an interface for editing dictionaries, wherein a dictionary includes a list of words that define a concept usable as a feature for training a classifier; presenting on the interface a positive-example input field configured to receive user-input words that are positive examples of the concept; presenting on the user interface a negative-example input field configured to receive user-input words that are negative examples of the concept; presenting on the user interface a suggestion-set field configured to display a system-generated list of suggested words, wherein the list of suggested words represents a generalized concept based on words in one or both of the positive-example input field or the negative-example field; receiving one or more user-input words that are positive or negative examples of the concept, wherein the one or more user-input words are received from one or more of A) a typed entry, or B) a selection of one or more suggested words from the suggestion-set field; generating a set of suggested words that represent a generalized concept based on the one or more user-input positive or negative examples; presenting the set of suggested words in the suggestion-set field on the user interface; receiving a user selection of a first suggested word from the set of suggested words; including the first suggested word in the positive-example field or the negative-example field; refining the set of suggested words based at least on the first suggested word that was included in the positive-example field or the negative-example field; presenting the refined set of suggested words in the first suggestion-set field; receiving an indication that the user has finished editing the dictionary; and saving the contents of the positive-example input field in a dictionary. - View Dependent Claims (19, 20)
-
Specification