Quantify the Effect of Training Data Biases on Downstream Genomics Tasks
Determine the magnitude of the impact that experiment-specific and technology-specific biases and batch effects in functional genomics datasets have on downstream applications, specifically enhancer sequence prediction and genetic variant effect prediction using deep neural network models trained on DNA sequence data.
References
It is unclear how strongly this affects downstream applications, such as enhancer sequence or genetic variant effect prediction.
— Metadata-guided Feature Disentanglement for Functional Genomics
(2405.19057 - Rakowski et al., 29 May 2024) in Section: Introduction