AUTOMATED CLASSIFICATION ALGORITHM COMPRISING AT LEAST ONE INPUT-INVARIANT PART
First Claim
1. A method for performing classification on classifiable electronic data, the method comprising:
- obtaining the classifiable electronic data via a communication network; and
identifying classifications of a plurality of data elements in the classifiable electronic data using a classification algorithm comprising at least one input-invariant part and at least one classification part, the at least one classification part incorporating user input concerning classification of at least one data element of the plurality of data elements.
2 Assignments
0 Petitions
Accused Products
Abstract
A classification algorithm is separated into one or more input-invariant parts and one or more input-dependent classification parts. The input-invariant parts of the classification algorithm capture the underlying and unchanging relationships between the plurality of data elements being operated upon by the classification algorithm, whereas the one or more classification parts embody the probabilistic labeling of the data elements according to the various classifications. For any given iteration, a user'"'"'s input is used to modify at least one classification part of the algorithm. Recalculated classification parts (i.e., updated classification results) are determined based on computationally simple combinations of the one or more modified classification parts and the one or more input-invariant parts. Preferably, a graphical user interface is used to solicit user input. In this manner, wait times between user feedback iterations can be dramatically reduced, thereby making application of active learning to classification tasks a practical reality.
-
Citations
20 Claims
-
1. A method for performing classification on classifiable electronic data, the method comprising:
-
obtaining the classifiable electronic data via a communication network; and identifying classifications of a plurality of data elements in the classifiable electronic data using a classification algorithm comprising at least one input-invariant part and at least one classification part, the at least one classification part incorporating user input concerning classification of at least one data element of the plurality of data elements. - View Dependent Claims (2, 3, 4)
-
-
5. In a system based on a classification algorithm operating upon classifiable electronic data, a method for incorporating user input concerning classification of a plurality of data elements in the classifiable electronic data, the method comprising:
-
calculating a first input-invariant part of the classification algorithm; receiving user input concerning classification of at least one data element of the plurality of data elements; modifying a first classification part of the classification algorithm based on the user input to provide a modified first classification part; and calculating a recalculated first classification part based on the modified first classification part and the first input-invariant part, wherein the recalculated first classification part comprises first classification probabilities for at least a portion of the plurality of data elements. - View Dependent Claims (6, 7, 8, 9, 10)
-
-
11. An apparatus for performing classification based on a classification algorithm operating upon classifiable electronic data, and incorporating user input concerning classification of a plurality of data elements in the classifiable electronic data, the apparatus comprising:
-
an input-invariant calculation component operative to calculate a first input-invariant part of the classification algorithm; a user input component operative to receive user input concerning classification of at least one data element of the plurality of data elements; and a classification calculation component, in communication with the input-invariant calculation component and the user input component, operative to modify a first classification part of the classification algorithm based, on the user input to provide a modified first classification part, and to calculate a recalculated first classification part based on the modified first classification part and the first input-invariant part, wherein the first classification part comprises first classification probabilities for at least a portion of the plurality of data elements. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. An apparatus for performing classification based on a classification algorithm operating upon classifiable electronic data, and incorporating user input concerning classification of a plurality of data elements in the classifiable electronic data, the apparatus comprising:
-
at least one processor; and at least one storage device, coupled to the at least one processor, comprising processor-executable instructions that, when executed by the at least one processor, cause the at least one processor to; calculate a first input-invariant part of the classification algorithm; receive user input concerning classification of at least one data element of the plurality of data elements; modify a first classification part of the classification algorithm based on the user input to provide a modified first classification part; and calculate a recalculated first classification part based on the modified first classification part and the first input-invariant part, wherein the recalculated first classification part comprises first classification probabilities for at least a portion of the plurality of data elements. - View Dependent Claims (18, 19, 20)
-
Specification