Failure feedback system for enhancing machine learning accuracy by synthetic data generation
First Claim
1. A non-transitory computer-accessible medium having stored thereon computer-executable instructions, wherein, when a computer arrangement executes the instructions, the computer arrangement is configured to perform procedures comprising:
- (a) receiving at least one dataset, wherein the at least one dataset includes a plurality of data types;
(b) determining if at least one misclassification is generated during a training of at least one model on the at least one dataset by determining if one of the data types is misclassified;
(c) assigning a classification score to each of the data types after the training of the at least one model;
(d) generating at least one synthetic dataset based on the at least one misclassification;
(e) determining if the at least one misclassification is generated during the training of the at least one model on the at least one synthetic dataset based on the assigned classification score being below a particular threshold; and
(f) iterating procedures (d) and (e) until the at least one misclassification is no longer determined during the training of the at least one model.
1 Assignment
0 Petitions
Accused Products
Abstract
An exemplary system, method, and computer-accessible medium can include, for example, (a) receiving a dataset(s), (b) determining if a misclassification(s) is generated during a training of a model(s) on the dataset(s), (c) generating a synthetic dataset(s) based on the misclassification(s), and (d) determining if the misclassification(s) is generated during the training of the model(s) on the synthetic dataset(s). The dataset(s) can include a plurality of data types. The misclassification(s) can be determined by determining if one of the data types is misclassified. The dataset(s) can include an identification of each of the data types in the dataset(s).
6 Citations
14 Claims
-
1. A non-transitory computer-accessible medium having stored thereon computer-executable instructions, wherein, when a computer arrangement executes the instructions, the computer arrangement is configured to perform procedures comprising:
-
(a) receiving at least one dataset, wherein the at least one dataset includes a plurality of data types; (b) determining if at least one misclassification is generated during a training of at least one model on the at least one dataset by determining if one of the data types is misclassified; (c) assigning a classification score to each of the data types after the training of the at least one model; (d) generating at least one synthetic dataset based on the at least one misclassification; (e) determining if the at least one misclassification is generated during the training of the at least one model on the at least one synthetic dataset based on the assigned classification score being below a particular threshold; and (f) iterating procedures (d) and (e) until the at least one misclassification is no longer determined during the training of the at least one model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-accessible medium having stored thereon computer-executable instructions, wherein, when a computer arrangement executes the instructions, the computer arrangement is configured to perform procedures comprising:
-
(a) receiving at least one dataset including an identification of a plurality of data types in the at least one dataset; (b) determining if at least one misclassification of at least one particular data type of the data types is generated during a training of at least one model on the at least one dataset; (c) assign a classification score to each of the data types; (d) generating at least one synthetic dataset based on the misclassified at least one particular data type, wherein the at least one synthetic dataset includes more of the at least one particular data type than the at least one dataset; (e) determining if the at least one misclassification is generated during the training of the at least one model on the at least one synthetic dataset based on the assigned classification score being below a particular threshold; (f) iterating procedures (d) and (e) until the at least one misclassification is no longer determined during the training of the at least one model. - View Dependent Claims (11)
-
-
12. A method, comprising:
-
(a) receiving at least one dataset, wherein the at least one dataset includes a plurality of data types; (b) determining if at least one misclassification is generated during a training of at least one model on the at least one dataset by determining if one of the data types is misclassified; (c) assigning a classification score to each of the data types after the training of the at least one model; (d) sending a request for at least one synthetic dataset based on the misclassification; (e) receiving the at least one synthetic dataset; (f) determining if the at least one misclassification is generated during the training of the at least one model on the at least one synthetic dataset based on the assigned classification score being below a particular threshold; and (g) using a computer hardware arrangement, iterating procedures (d)-(f) until the at least one misclassification is no longer determined during the training of the at least one model. - View Dependent Claims (13, 14)
-
Specification