Cause of performance discrepancy for training with mixed original/random data
Determine whether the observed behavior—where TFC-TDF-UNet v3 trained with an original-mix sampling probability p=0.5 performs better on training-set original mixes but worse on validation-set original mixes compared to training solely with random mixes—is caused by repeated exposure to the same original mixtures during training, leading to overfitting.
References
We conjecture this is because the $p!=!0.5$ model has seen the same mixes too many times during training, which we explore in more detail in Section \ref{sec:effective}.
— Why does music source separation benefit from cacophony?
(2402.18407 - Jeon et al., 28 Feb 2024) in Section “Training Dynamics Comparison”