Reducing the size of a training set for classification
First Claim
1. A method for reducing the size of a design data set, comprising:
- computing a decision boundary which separates a first group of data patterns in a training data set from a second group of data patterns in the training data set;
for each data pattern in the training data set, determining if removing the data pattern from the training data set substantially affects the resulting decision boundary; and
if so, marking the data pattern as a key pattern; and
removing all data patterns that are not marked as key patterns to produce a reduced training data set which represents the decision boundary.
2 Assignments
0 Petitions
Accused Products
Abstract
A system that reduces the size of a design data set. During this design data set reduction operation, the system computes a decision boundary which separates a first group of data patterns in a training data set from a second group of data patterns in the training data set. For each data pattern in the training data set, the system determines if removing the data pattern from the training data set substantially affects the resulting decision boundary. If so, the system marks the data pattern as a key pattern. The system then removes all data patterns that are not marked as key patterns to produce a reduced training data set which represents the decision boundary.
-
Citations
19 Claims
-
1. A method for reducing the size of a design data set, comprising:
-
computing a decision boundary which separates a first group of data patterns in a training data set from a second group of data patterns in the training data set;
for each data pattern in the training data set, determining if removing the data pattern from the training data set substantially affects the resulting decision boundary; and
if so, marking the data pattern as a key pattern; and
removing all data patterns that are not marked as key patterns to produce a reduced training data set which represents the decision boundary. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-readable storage medium storing instructions that when executed by a computer cause the computer to perform a method for reducing the size of a design data set, the method comprising:
-
computing a decision boundary which separates a first group of data patterns in a training data set from a second group of data patterns in the training data set;
for each data pattern in the training data set, determining if removing the data pattern from the training data set substantially affects the resulting decision boundary; and
if so, marking the data pattern as a key pattern; and
removing all data patterns that are not marked as key patterns to produce a reduced training data set which represents the decision boundary. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An apparatus that reduces the size of a design data set, comprising:
-
a computing mechanism configured to compute a decision boundary which separates a first group of data patterns in a training data set from a second group of data patterns in the training data set;
a pattern-reduction mechanism configured to;
for each data pattern in the training data set, to determine if removing the data pattern from the training data set substantially affects the resulting decision boundary; and
if so, to mark the data pattern as a key pattern; and
toremove all data patterns that are not marked as key patterns to produce a reduced training data set which represents the decision boundary. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification