Effectiveness of data augmentation on predictive performance
Determine whether class-imbalance data augmentation techniques (such as SMOTE, ADASYN, SVMSMOTE, Random Over-Sampling, Random Under-Sampling, and Cluster Centroids) truly improve the predictive performance of machine-learning classifiers on real-world datasets, rather than merely appearing to help due to evaluation biases.
References
Therefore, there still exists uncertainty as to whether data augmentation can truly help improving prediction performance.
— Experimenting with an Evaluation Framework for Imbalanced Data Learning (EFIDL)
(2301.10888 - Li et al., 2023) in Introduction