Towards Robust Deep Active Learning for Scientific Computing (2201.12632v2)

Published 29 Jan 2022 in cs.LG and cs.AI

Abstract: Deep learning (DL) is revolutionizing the scientific computing community. To reduce the data gap, active learning has been identified as a promising solution for DL in the scientific computing community. However, the deep active learning (DAL) literature is dominated by image classification problems and pool-based methods. Here we investigate the robustness of pool-based DAL methods for scientific computing problems (dominated by regression) where DNNs are increasingly used. We show that modern pool-based DAL methods all share an untunable hyperparameter, termed the pool ratio, denoted $\gamma$, which is often assumed to be known apriori in the literature. We evaluate the performance of five state-of-the-art DAL methods on six benchmark problems if we assume $\gamma$ is \textit{not} known - a more realistic assumption for scientific computing problems. Our results indicate that this reduces the performance of modern DAL methods and that they sometimes can even perform worse than random sampling, creating significant uncertainty when used in real-world settings. To overcome this limitation we propose, to our knowledge, the first query synthesis DAL method for regression, termed NA-QBC. NA-QBC removes the sensitive $\gamma$ hyperparameter and we find that, on average, it outperforms the other DAL methods on our benchmark problems. Crucially, NA-QBC always outperforms random sampling, providing more robust performance benefits.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

Discriminative Active Learning (2019)
A Comparative Survey of Deep Active Learning (2022)
Does Deep Active Learning Work in the Wild? (2023)
A Survey of Deep Active Learning (2020)
A Survey on Deep Active Learning: Recent Advances and New Frontiers (2024)