Manage Generators
You need to be the owner or have Editor access to a Generator to modify it.
Generator Statuses
The status of a Generator indicates its current state.
Status | Description | Next actions |
---|---|---|
A Generator is created with a default or modified configuration. Training is not started. | • Start training • Clone configuration • Delete | |
Generator training is queued until cluster resources become available. | • Share • Clone configuration • Cancel training • Delete | |
Generator training is ongoing. | • Share • Clone configuration • Cancel training • Delete | |
A trained Generator was cloned to improve its quality with further training. Training not started. | • Start training • Share • Clone configuration • Delete | |
Generator has completed training successfully and can now generate synthetic datasets. | • Generate data • Share • Export to file • Clone configuration • Continue training • Delete | |
Generator training started and then failed. | • Share • Clone configuration • Delete | |
Generator training was canceled while in progress. | • Share • Clone configuration • Delete |
Clone a Generator
If you want to reuse the data and model configuration from an existing Generator, you can clone it. All previously added data as well as the model and training configuration are copied to the new Generator.
Before you start, keep in mind:
- Cloning is available only for generators that use a database or a cloud storage connector as a data source.
- You cannot clone generators with uploaded files because the uploaded data is deleted after the generator training completes.
Clone a Generator on the MOSTLY AI Platform, by following the steps below.
Steps
- Clone a Generator directly from the Generators page.
- From the Generators page, click the kebab menu of a Generator, and select Clone configuration.
- Clone a Generator after you open it.
- From the Generators page, click a Generator to open it.
- Open the actions menu by clicking the Generator name.
- Click Clone configuration.
Result
A new Generator with the name of the original Generator with the prefix Clone - prepended to it is created.
What’s next
You can now use the data and model configuration from the previous generator and make any necessary changes before starting its training.
Continue training
There may be cases where you need to improve the quality of a Generator by resuming its training from the current weights of the model. In such cases, you can use the Continue training option.
Prerequisites
- The Generator you want to improve must have already completed training successfully. You cannot improve Generators with Failed or Canceled status.
- You can only continue training Generators that use a database or a cloud storage bucket as a data source. Uploaded files used for generator training are deleted immediately after training and therefore cannot be used to continue generator training.
- The source data must be available in the associated Connector entity.
- You must have the Editor role of the selected Generator.
You can continue Generator training from the Platform or with the Synthetic Data SDK.
To continue Generator training, follow these steps.
Steps
- Continue the training of a Generator in one of two ways.
- From the Generators page, click the kebab menu of a Generator, and select Continue training.
- From the Generator page, open the action menu by clicking the Generator name, and select Continue training.
The Generator is cloned with the status
CONTINUE
. You can now configure the model and training options. The Generator name is prefixed with Continue training - followed by the name of the original Generator.
- (Optional) Click a table to expand its model and training configuration to adjust as needed.
- Click Continue training to start the Generator training.
Result
MOSTLY AI fetches the original data from the data source and continues the training from the already saved model weights. The Generator status is updated to CONTINUE
.
What’s next
You can use the newly trained generator to generate a new synthetic dataset or probe it for immediately generated samples.
Share a Generator
You can share a generator with members of your organization, see Manage Resources.
Consider the following when sharing a Generator:
- A Generator is a standalone entity. When sharing a Generator, it is not necessary to share the Connector associated with it in order to generate synthetic data.
- Unlisted Generators cannot be cloned for continued training. If your downstream users expect to continue Generator training, they must be granted Editor access to the Generator.
- If you pass the ownership of a Generator to an organization, people in the organization can generate data with it. If members of this organization wish to continue training with the Generator, you must also grant access to the underlying Connector entity as well.
Delete a Generator
A Generator consists of a dedicated generative AI model for each table of the source dataset. Depending on the size of your original data, it can take a long time to train a new one.
- Delete a Generator in one of two ways.
- From the Generators page, click the kebab menu of a generator, and select Delete.
- On the page of the selected Generator, open the action menu by clicking the Generator name, and select Delete.
Result
The Generator has been deleted.