ENHANCED MAX MARGIN LEARNING ON MULTIMODAL DATA MINING IN A MULTIMEDIA DATABASE
First Claim
1. A method for multimodal data mining, comprising:
- defining a multimodal data set comprising image information;
representing image information of a data object as a set of feature vectors in a feature space;
clustering in the feature space to group similar features;
associating a non-image representation with a respective image data object based on the clustering;
determining a joint feature representation of a respective data object as a mathematical weighted combination of a set of components of the joint feature representation;
optimizing a weighting for a plurality of components of the mathematical weighted combination with respect to a prediction error between a predicted classification and a training classification; and
employing the mathematical weighted combination for automatically classifying a new data object.
2 Assignments
0 Petitions
Accused Products
Abstract
Multimodal data mining in a multimedia database is addressed as a structured prediction problem, wherein mapping from input to the structured and interdependent output variables is learned. A system and method for multimodal data mining is provided, comprising defining a multimodal data set comprising image information; representing image information of a data object as a set of feature vectors in a feature space; clustering in the feature space to group similar features; associating a non-image representation with a respective image data object based on the clustering; determining a joint feature representation of a respective data object as a mathematical weighted combination of a set of components of the joint feature representation; optimizing a weighting for a plurality of components of the mathematical weighted combination with respect to a prediction error between a predicted classification and a training classification; and employing the mathematical weighted combination for automatically classifying a new data object.
-
Citations
16 Claims
-
1. A method for multimodal data mining, comprising:
-
defining a multimodal data set comprising image information; representing image information of a data object as a set of feature vectors in a feature space; clustering in the feature space to group similar features; associating a non-image representation with a respective image data object based on the clustering; determining a joint feature representation of a respective data object as a mathematical weighted combination of a set of components of the joint feature representation; optimizing a weighting for a plurality of components of the mathematical weighted combination with respect to a prediction error between a predicted classification and a training classification; and employing the mathematical weighted combination for automatically classifying a new data object. - View Dependent Claims (2)
-
-
3. A system for multimodal data mining, comprising:
-
an input adapted to receive a multimodal data set comprising image information; an automated processor, adapted for; representing image information of a data object as a set of feature vectors in a feature space; clustering in the feature space to group similar features; associating a non-image representation with a respective image data object based on the clustering; determining a joint feature representation of a respective data object as a mathematical weighted combination of a set of components of the joint feature representation; optimizing a weighting for a plurality of components of the mathematical weighted combination with respect to a prediction error between a predicted classification and a training classification; and an output from the automated processor, adapted to communicate a classification of a new data object based on the mathematical weighted combination. - View Dependent Claims (4)
-
-
5. A data processing method, comprising:
-
(a) receiving a dual variable set μ
comprising labeled examples;(b) decomposing μ
into two partitions, μ
B and μ
N;(c) iteratively solving with an automated data processor, using the variables in μ
B, for each member of the set;
min ½
μ
TDμ
−
μ
TS
s.t. Aμ
C
μ
B′
0,μ
N=0while there exists μ
i,y ε
μ
B such that μ
i,y =0, move that variable to partition μ
N;while there exists μ
i,y ε
μ
N satisfying the condition
∃
i, Σy μ
i,y <
C
∃
μ
i,y ε
μ
N α
TΦ
i,yi ,y −
l(y ,yi)<
0′move that variable to partition μ
B and if no such μ
i,y ε
μ
N exists, ceasing an iteration; and(d) outputting at least one of;
(a) the members of the partitions and (b) an identification of the members of the partitions. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
Specification