Optimal number of synthetic samples for tabular augmentation
Determine the optimal size N_syn of the synthetic dataset to use in tabular data augmentation for classification tasks on tabular datasets.
References
The optimal $N_{\text{syn}$ remains an open problem for tabular data.
— TabEBM: A Tabular Data Augmentation Method with Distinct Class-Specific Energy-Based Models
(2409.16118 - Margeloiu et al., 2024) in Data augmentation setup, Section 3 (Experiments)