Towards out-of-distribution generalization in large-scale astronomical surveys: robust networks learn similar representations (2311.18007v1)
Abstract: The generalization of ML models to out-of-distribution (OOD) examples remains a key challenge in extracting information from upcoming astronomical surveys. Interpretability approaches are a natural way to gain insights into the OOD generalization problem. We use Centered Kernel Alignment (CKA), a similarity measure metric of neural network representations, to examine the relationship between representation similarity and performance of pre-trained Convolutional Neural Networks (CNNs) on the CAMELS Multifield Dataset. We find that when models are robust to a distribution shift, they produce substantially different representations across their layers on OOD data. However, when they fail to generalize, these representations change less from layer to layer on OOD data. We discuss the potential application of similarity representation in guiding model design, training strategy, and mitigating the OOD problem by incorporating CKA as an inductive bias during training.
- Building Trustworthy Machine Learning Models for Astronomy. arXiv e-prints, art. arXiv:2111.14566, November 2021. doi: 10.48550/arXiv.2111.14566.
- Interpreting deep learning models for weak lensing. Phys. Rev. D, 102:123506, Dec 2020. doi: 10.1103/PhysRevD.102.123506. URL https://link.aps.org/doi/10.1103/PhysRevD.102.123506.
- Learning from the machine: interpreting machine learning algorithms for point- and extended-source classification. Monthly Notices of the Royal Astronomical Society, 481(3):4194–4205, 09 2018. ISSN 0035-8711. doi: 10.1093/mnras/sty2575. URL https://doi.org/10.1093/mnras/sty2575.
- John F. Wu. Connecting optical morphology, environment, and h i mass fraction for low-redshift galaxies using deep learning. The Astrophysical Journal, 900(2):142, sep 2020. doi: 10.3847/1538-4357/abacbb. URL https://dx.doi.org/10.3847/1538-4357/abacbb.
- Miles Cranmer. Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl. arXiv e-prints, art. arXiv:2305.01582, May 2023. doi: 10.48550/arXiv.2305.01582.
- Understanding Robust Learning through the Lens of Representation Similarities. arXiv e-prints, art. arXiv:2206.09868, June 2022. doi: 10.48550/arXiv.2206.09868.
- The CAMELS Multifield Dataset: Learning the Universe’s Fundamental Parameters with Artificial Intelligence. arXiv e-prints, art. arXiv:2109.10915, September 2021a.
- The CAMELS Project: Cosmology and Astrophysics with Machine-learning Simulations. Astrophysical Journal, 915(1):71, July 2021b. doi: 10.3847/1538-4357/abf7ba.
- Similarity of Neural Network Representations Revisited. arXiv e-prints, art. arXiv:1905.00414, May 2019. doi: 10.48550/arXiv.1905.00414.
- Do wide and deep networks learn the same things? uncovering how neural network representations vary with width and depth, 2021.
- Multifield Cosmology with Artificial Intelligence. arXiv e-prints, art. arXiv:2109.09747, September 2021c. doi: 10.48550/arXiv.2109.09747.
- The camels multifield data set: Learning the universe’s fundamental parameters with artificial intelligence. The Astrophysical Journal Supplement Series, 259(2):61, apr 2022. doi: 10.3847/1538-4365/ac5ab0. URL https://dx.doi.org/10.3847/1538-4365/ac5ab0.