Method, program and apparatus for generating two-class classification/prediction model
First Claim
1. A program for generating a two-class classification/prediction model, said program causing a computer to perform a process comprising:
- a) preparing as training data a sample set that contains a plurality of samples belonging to a first class and a plurality of samples belonging to a second class;
b) generating, by performing discriminant analysis on said sample set, a first discriminant function having a high classification characteristic for said first class and a second discriminant function having a high classification characteristic for said second class;
c) by classifying said sample set using said first and second discriminant functions, isolating any sample whose classification results by said first and second discriminant functions do not match;
d) repeating said b) and c) by using a new sample set which is formed by grouping together any sample isolated in said c); and
e) causing said d) to stop when the number of samples each of whose classification results do not match in said c) has decreased to or below a predetermined value or when the number of repetitions or processing time for repetitions has reached or exceeded a predetermined value.
1 Assignment
0 Petitions
Accused Products
Abstract
A method includes: a) preparing as training data a sample set that contains a plurality of samples belonging to a first class and a plurality of samples belonging to a second class; b) generating, by performing discriminant analysis on the sample set, a first discriminant function having a high classification characteristic for the first class and a second discriminant function having a high classification characteristic for the second class; c) by classifying the sample set using the first and second discriminant functions, isolating any sample whose classification results by the first and second discriminant functions do not match; d) forming a new sample set by grouping together any sample thus isolated, and repeating b) and c) by using the new sample set; and e) causing d) to stop when the number of samples each of whose classification results do not match in c) has decreased to or below a predetermined value.
15 Citations
20 Claims
-
1. A program for generating a two-class classification/prediction model, said program causing a computer to perform a process comprising:
-
a) preparing as training data a sample set that contains a plurality of samples belonging to a first class and a plurality of samples belonging to a second class; b) generating, by performing discriminant analysis on said sample set, a first discriminant function having a high classification characteristic for said first class and a second discriminant function having a high classification characteristic for said second class; c) by classifying said sample set using said first and second discriminant functions, isolating any sample whose classification results by said first and second discriminant functions do not match; d) repeating said b) and c) by using a new sample set which is formed by grouping together any sample isolated in said c); and e) causing said d) to stop when the number of samples each of whose classification results do not match in said c) has decreased to or below a predetermined value or when the number of repetitions or processing time for repetitions has reached or exceeded a predetermined value. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for generating a two-class classification/prediction model, comprising:
-
a) preparing as training data a sample set that contains a plurality of samples belonging to a first class and a plurality of samples belonging to a second class; b) generating, by performing discriminant analysis on said sample set, a first discriminant function having a high classification characteristic for said first class and a second discriminant function having a high classification characteristic for said second class; c) by classifying said sample set using said first and second discriminant functions, isolating any sample whose classification results by said first and second discriminant functions do not match; d) repeating said b) and c) by using a new sample set which is formed by grouping together any sample isolated in said c); and e) causing said d) to stop when the number of samples each of whose classification results do not match in said c) has decreased to or below a predetermined value or when the number of repetitions or processing time for repetitions has reached or exceeded a predetermined value, and wherein said first and second discriminant functions determined in said b) are set up as a classification/prediction model for samples of unknown classes. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A method for generating a chemical toxicity prediction model, comprising:
-
a) preparing as training data a sample set that contains a plurality of chemicals belonging to a first class and a plurality of chemicals belonging to a second class, wherein said chemicals in said first class have a specific kind of toxicity and said chemicals in said second class do not have said toxicity; b) generating, by performing discriminant analysis on said sample set, a first discriminant function having a high classification characteristic for said first class and a second discriminant function having a high classification characteristic for said second class; c) by classifying said sample set using said first and second discriminant functions, isolating any chemical whose classification results by said first and second discriminant functions do not match; d) repeating said b) and c) by using a new sample set which is formed by grouping together any chemical isolated in said c); and e) causing said d) to stop when the number of chemicals each of whose classification results do not match in said c) has decreased to or below a predetermined value or when the number of repetitions or processing time for repetitions has reached or exceeded a predetermined value, and wherein said first and second discriminant functions determined in said b) after completion of said e) are set up as a classification/prediction model for chemicals of unknown classes. - View Dependent Claims (12, 13, 14)
-
-
15. An apparatus for generating a two-class classification/prediction model, comprising:
-
an input device which enters as training data a sample set that contains a plurality of samples belonging to a first class and a plurality of samples belonging to a second class; a discriminant function generating device which generates, by performing discriminant analysis on said sample set, a first discriminant function having a high classification characteristic for said first class and a second discriminant function having a high classification characteristic for said second class; a classification result comparing device which classifies said sample set by using said first and second discriminant functions, and isolates any sample whose classification results by said first and second discriminant functions do not match; and a control device which forms a new sample set by grouping together any sample isolated by said classification result comparing device, and causes said discriminant function generating device and said classification result comparing device to operate repeatedly, and wherein said control device causes said repeating operation to stop when the number of samples each of whose classification results do not match in said classification result comparing device has decreased to or below a predetermined value or when the number of repetitions or processing time for repetitions has reached or exceeded a predetermined value. - View Dependent Claims (16)
-
-
17. An apparatus for generating a chemical toxicity prediction model, comprising:
-
an input device which enters as training data a sample set that contains a plurality of chemicals belonging to a first class and a plurality of chemicals belonging to a second class, wherein said chemicals in said first class have a specific kind of toxicity and said chemicals in said second class do not have said toxicity; a discriminant function generating device which generates, by performing discriminant analysis on said sample set, a first discriminant function having a high classification characteristic for said first class and a second discriminant function having a high classification characteristic for said second class; a classification result comparing device which classifies said sample set by using said first and second discriminant functions, and isolates any chemical whose classification results by said first and second discriminant functions do not match; and a control device which forms a new sample set by grouping together any chemical isolated by said classification result comparing device, and causes said discriminant function generating device and said classification result comparing device to operate repeatedly, and wherein said control device causes said repeating operation to stop when the number of chemicals each of whose classification results do not match in said classification result comparing device has decreased to or below a predetermined value or when the number of repetitions or processing time for repetitions has reached or exceeded a predetermined value. - View Dependent Claims (18, 19)
-
-
20. A classification/prediction method for samples of unknown classes, comprising:
generating a plurality of discriminant function sets each containing a first discriminant function and a second discriminant function, wherein said plurality of discriminant function sets are each generated by carrying out a) preparing as training data a sample set that contains a plurality of samples belonging to a first class and a plurality of samples belonging to a second class, b) generating, by performing discriminant analysis on said sample set, said first discriminant function which has a high classification characteristic for said first class and said second discriminant function which has a high classification characteristic for said second class; c) by classifying said sample set using said first and second discriminant functions, isolating any sample whose classification results by said first and second discriminant functions do not match; d) repeating said b) and c)by using a new sample set which is formed by grouping together any sample isolated in said c); and e) causing said d) to stop when the number of samples each of whose classification results do not match in said c) has decreased to or below a predetermined value or when the number of repetitions or processing time for repetitions has reached or exceeded a predetermined value; obtaining classification results by applying said first and second discriminant functions contained in a first generated one of said plurality of discriminant function sets to said samples of unknown classes; and sequentially applying said discriminant function sets in order of generation to said samples of unknown classes until said obtained classification results match, wherein the class indicated by said classification results when said classification results match is predicted to be the class to which a corresponding one of said samples of unknown classes belongs.
Specification