Data model generation using generative adversarial networks
First Claim
1. A method for generating data models, comprising:
- receiving, by a model optimizer from an interface, a data model generation request;
provisioning, by the model optimizer, computing resources with a data model;
generating, by a dataset generator, a synthetic dataset for training the data model using a generative network of a generative adversarial network, the generative network trained to generate output data differing at least a predetermined amount from a reference dataset according to a similarity metric;
generating, by the model optimizer, at least one of a statistical correlation score between the synthetic dataset and the reference dataset, a data similarity score between the synthetic dataset and the reference dataset, and a data quality score for the synthetic dataset;
training, by the computing resources, the data model using the synthetic dataset, wherein training the data model using the synthetic dataset comprises determining that the synthetic dataset satisfies a criterion concerning the at least one of the statistical correlation score between the synthetic dataset and the reference dataset, the data similarity score between the synthetic dataset and the reference dataset, and the data quality score for the synthetic dataset;
evaluating, by the model optimizer, performance criteria of the data model;
storing, by the model optimizer in a model storage, the data model and metadata of the data model based on the evaluation of the performance criteria of the data model; and
processing production data using the data model.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods for generating data models using a generative adversarial network can begin by receiving a data model generation request by a model optimizer from an interface. The model optimizer can provision computing resources with a data model. As a further step, a synthetic dataset for training the data model can be generated using a generative network of a generative adversarial network, the generative network trained to generate output data differing at least a predetermined amount from a reference dataset according to a similarity metric. The computing resources can train the data model using the synthetic dataset. The model optimizer can evaluate performance criteria of the data model and, based on the evaluation of the performance criteria of the data model, store the data model and metadata of the data model in a model storage. The data model can then be used to process production data.
-
Citations
19 Claims
-
1. A method for generating data models, comprising:
-
receiving, by a model optimizer from an interface, a data model generation request; provisioning, by the model optimizer, computing resources with a data model; generating, by a dataset generator, a synthetic dataset for training the data model using a generative network of a generative adversarial network, the generative network trained to generate output data differing at least a predetermined amount from a reference dataset according to a similarity metric; generating, by the model optimizer, at least one of a statistical correlation score between the synthetic dataset and the reference dataset, a data similarity score between the synthetic dataset and the reference dataset, and a data quality score for the synthetic dataset; training, by the computing resources, the data model using the synthetic dataset, wherein training the data model using the synthetic dataset comprises determining that the synthetic dataset satisfies a criterion concerning the at least one of the statistical correlation score between the synthetic dataset and the reference dataset, the data similarity score between the synthetic dataset and the reference dataset, and the data quality score for the synthetic dataset; evaluating, by the model optimizer, performance criteria of the data model; storing, by the model optimizer in a model storage, the data model and metadata of the data model based on the evaluation of the performance criteria of the data model; and processing production data using the data model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A cloud computing system for generating data models, comprising:
-
at least one processor; and at least one non-transitory memory storing instructions that, when executed by the at least one processor, cause the cloud computing system to perform operations comprising; receiving, by a model optimizer from an interface, a data model generation request; provisioning, by the model optimizer, computing resources with a data model; generating, by a generative network of a generative adversarial network, a synthetic dataset for training the data model; generating, by the model optimizer, at least one of a statistical correlation score between the synthetic dataset and the reference dataset, a data similarity score between the synthetic dataset and the reference dataset, and a data quality score for the synthetic dataset; training, by the computing resources, the data model using the synthetic dataset, wherein training the data model using the synthetic dataset comprises determining that the synthetic dataset satisfies a criterion concerning the at least one of the statistical correlation score between the synthetic dataset and the reference dataset, the data similarity score between the synthetic dataset and the reference dataset, and the data quality score for the synthetic dataset; evaluating, by the model optimizer, performance criteria of the data model; storing, by the model optimizer in a model storage, the data model and metadata of the data model based on the evaluation of the performance criteria of the data model; and processing production data using the data model. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
Specification