Adaptive data classification for data mining
First Claim
1. A computer-implemented method comprising:
- using a programmed digital computer to automatically perform steps comprising;
receiving input from a user defining a classification;
determining a sample set of unstructured data based on the classification defined by the user;
analyzing the sample set of unstructured data to determine a classification mapping that maps attributes of the sample set of unstructured data to class labels for the classification;
indexing attributes of a set of data objects in a second set of unstructured data;
mapping one or more data objects in the set of data objects to the class label based on the classification mapping that maps indexed attributes of the one or more data objects to the class label;
receiving an ad-hoc query from the one or more users;
providing relevant results from the unstructured data based on the classification mapping of the indexed attributes of the one or more data objects and the query;
determining feedback based on the user'"'"'s response to an interaction with results determined using the class label, wherein the user'"'"'s response comprises explicit feedback received from one or more users who are viewing results determined using the classification mapping, and the explicit feedback comprises an indication that a result is misclassified in the classification; and
adapting the classification mapping based on an analysis of the misclassified result received as feedback by adjusting the sample set used to determine the classification mapping to the class label.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for adaptive classification during information retrieval from unstructured data are provided. The method includes receiving input from a user defining a classification. A sample set of unstructured data based on the user defined classification defined is determined. The sample set of unstructured data is analyzed to determine a classification mapping that maps attributes of the sample set of unstructured data to class labels for the classification. The attributes of a set of data objects in a second set of unstructured data are indexed and one or more data objects in the set of data objects are mapped to the class label based on the classification mapping. Feedback based on the user'"'"'s response to an interaction with results is determined using the class label. Finally, adaptive classification mapping is performed based on analysis of feedback by adjusting the sample set of data objects.
-
Citations
14 Claims
-
1. A computer-implemented method comprising:
using a programmed digital computer to automatically perform steps comprising; receiving input from a user defining a classification; determining a sample set of unstructured data based on the classification defined by the user; analyzing the sample set of unstructured data to determine a classification mapping that maps attributes of the sample set of unstructured data to class labels for the classification; indexing attributes of a set of data objects in a second set of unstructured data; mapping one or more data objects in the set of data objects to the class label based on the classification mapping that maps indexed attributes of the one or more data objects to the class label; receiving an ad-hoc query from the one or more users; providing relevant results from the unstructured data based on the classification mapping of the indexed attributes of the one or more data objects and the query; determining feedback based on the user'"'"'s response to an interaction with results determined using the class label, wherein the user'"'"'s response comprises explicit feedback received from one or more users who are viewing results determined using the classification mapping, and the explicit feedback comprises an indication that a result is misclassified in the classification; and adapting the classification mapping based on an analysis of the misclassified result received as feedback by adjusting the sample set used to determine the classification mapping to the class label. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. An apparatus comprising:
-
one or more processors; and logic encoded in one or more tangible media for execution by the one or more processors and when executed operable to; receive input from a user defining a classification; determine a sample set of unstructured data based on the classification defined by the user; analyze the sample set of unstructured data to determine a classification mapping that maps attributes of the sample set of unstructured data to class labels for the classification; index attributes of a set of data objects in a second set of unstructured data; map one or more data objects in the set of data objects to the class label based on the classification mapping that maps indexed attributes of the one or more data objects to the class label; receive an ad-hoc query from the one or more users; provide the results from the relevant unstructured data based on the classification mapping of the indexed attributes of the one or more data objects and the query; determine feedback based on the user'"'"'s response to an interaction with results determined using the class label, wherein the feedback comprises explicit feedback received from one or more users who are viewing results determined using the classification mapping, and the explicit feedback comprises an indication that a result is misclassified in the classification; and adapt the classification mapping based on an analysis of the misclassified result received as feedback by adjusting the sample set used to determine the classification mapping to the class label. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification