Data classification method and data classification device
First Claim
1. A data classification apparatus including a computer, the data classification apparatus comprising:
- a separation surfaces set storage unit configured to store information defining a plurality of separation surfaces which separate a feature space into at least one known class region respectively corresponding to at least one known class and an unknown class region, wherein each of the at least one known class region is separated from outside region by more than one of the plurality of separation surfaces which do not intersect to each other;
a classification unit configured to determine a classification of a classification target data whose inner product in the feature space is calculable by calculating to which region of the at least one known class region and the unknown class region determined by the information stored in the separation surface set storage unit the classification target data belongs; and
a separation surface set calculation unit configured to calculate the plurality of separation surfaces based on;
a plurality of training data respectively classified into any of the at least one known class and whose inner product in the feature space is calculable; and
a classification of each of the plurality of training data, to store the information which defines the plurality of separation surfaces in the separation surface set storage unit,wherein the separation surface set calculation unit is configured to calculate the plurality of separation surfaces by setting minimization of a classification error of the plurality of training data, minimization of a complexity of the plurality of separation surfaces, and minimization of an area of each of the at least one known class region as optimization target, andwherein the optimization target is targeted to solve either one of the following optimization problems;
1 Assignment
0 Petitions
Accused Products
Abstract
A separation surface set storage part stores information defining a plurality of separation surfaces which separate a feature space into at least one known class region respectively corresponding to at least one known class and an unknown class region. Each of the at least one known class region is separated from outside region by more than one of the plurality of separation surfaces which do not intersect to each other. A data classification apparatus determine a classification of a classification target data whose inner product in the feature space is calculable by calculating to which region of the at least one known class region and the unknown class region determined by the information stored in the separation surface set storage part the classification target data belongs. A method and apparatus for data classification which can simultaneously perform identification and outlying value classification with high reliability in a same procedure are provided.
12 Citations
17 Claims
-
1. A data classification apparatus including a computer, the data classification apparatus comprising:
-
a separation surfaces set storage unit configured to store information defining a plurality of separation surfaces which separate a feature space into at least one known class region respectively corresponding to at least one known class and an unknown class region, wherein each of the at least one known class region is separated from outside region by more than one of the plurality of separation surfaces which do not intersect to each other; a classification unit configured to determine a classification of a classification target data whose inner product in the feature space is calculable by calculating to which region of the at least one known class region and the unknown class region determined by the information stored in the separation surface set storage unit the classification target data belongs; and a separation surface set calculation unit configured to calculate the plurality of separation surfaces based on;
a plurality of training data respectively classified into any of the at least one known class and whose inner product in the feature space is calculable; and
a classification of each of the plurality of training data, to store the information which defines the plurality of separation surfaces in the separation surface set storage unit,wherein the separation surface set calculation unit is configured to calculate the plurality of separation surfaces by setting minimization of a classification error of the plurality of training data, minimization of a complexity of the plurality of separation surfaces, and minimization of an area of each of the at least one known class region as optimization target, and wherein the optimization target is targeted to solve either one of the following optimization problems; - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A data classification method comprising:
-
inputting classification target data whose inner product in a feature space is calculable; inputting a plurality of separation surfaces which separate the feature space into at least one known class region respectively corresponding to at least one known class and an unknown class region from a separation surface set storage part, wherein each of the at least one known class region is separated from outside region by more than one of the plurality of separation surfaces which do not intersect to each other; classifying the classification target data by calculating to which region of the at least one known class region and the unknown class region the classification target data belongs; calculating the plurality of separation surfaces based on;
a plurality of training data respectively classified into any of the at least one known class and whose inner product in the feature space is calculable; anda classification of each of the plurality of training data, to store the information which defines the plurality of separation surfaces in the separation surface set storage part, wherein in the calculating, the plurality of separation surfaces are calculated by setting minimization of a classification error of the plurality of training data, minimization of a complexity of the plurality of separation surfaces, and minimization of an area of each of the at least one known class region as optimization target, and wherein the optimization target is targeted to solve either one of the following optimization problems; - View Dependent Claims (12, 13, 14)
-
-
15. A separation surface set calculation apparatus including a computer, the separation surface set calculation apparatus comprising:
-
a training data storage device configured to store a plurality of training data whose inner product in a feature space is calculable and respectively classified into any of at least one known class; a separation surface set calculation device configured to calculate a plurality of separation surfaces which separate the feature space into at least one known class region respectively corresponding to the at least one known class and an unknown class region, based on;
the plurality of training data stored in the training data storage device, and a classification of each of the plurality of training data,wherein each of the at least one known class region is separated from outside region by more than one of the plurality of separation surfaces which do not intersect to each other; and a separation surface set storage device configured to store information defining the plurality of separation surfaces, wherein the separation surface set calculation device is configured to calculate the plurality of separation surfaces by setting minimization of a classification error of the plurality of training data, minimization of a complexity of the plurality of separation surfaces, and minimization of an area of each of the at least one known class region as optimization target, and wherein the optimization target is targeted to solve either one of the following optimization problems; - View Dependent Claims (16, 17)
-
Specification