Tutorials

Tutorials

The tutorial repository provides walkthroughs for exploring, evaluating, and validating synthetic data. Tutorials can be executed locally by cloning the repository and running the notebooks in Jupyter Lab, or accessed via Google Colab to run in a managed cloud environment. Each tutorial demonstrates a distinct capability of the synthetic data platform.

TutorialColab LinkGitHub Link

Getting started with the SDK

Google Colab iconView Notebook

Validate synthetic data via Train-Synthetic-Test-Real

Google Colab iconView Notebook

Explore the size vs. accuracy trade-off for synthetic data

Google Colab iconView Notebook

Differentially private synthetic data

Google Colab iconView Notebook

Rebalance synthetic datasets for data augmentation

Google Colab iconView Notebook

Conditionally generate synthetic (geo) data

Google Colab iconView Notebook

Explain AI with synthetic data

Google Colab iconView Notebook

Generate fair synthetic data

Google Colab iconView Notebook

Generate synthetic text via a fast LSTM model trained from scratch

Google Colab iconView Notebook

Generate synthetic text via a pre-trained Large Language Model

Google Colab iconView Notebook

Perform multi-table synthesis

Google Colab iconView Notebook

Analyse star-schema correlations

Google Colab iconView Notebook

Develop a fake or real discriminator with Synthetic Data

Google Colab iconView Notebook

Close gaps in your data with Smart Imputation

Google Colab iconView Notebook

Calculate accuracy and privacy metrics for Quality Assurance

Google Colab iconView Notebook

Enrich Sensitive Data with LLMs using Synthetic Replicas

Google Colab iconView Notebook

MOSTLY AI vs. SDV comparison: single-table scenario

Google Colab iconView Notebook

MOSTLY AI vs. SDV comparison: sequential scenario

Google Colab iconView Notebook