Papers
Topics
Authors
Recent
Search
2000 character limit reached

Towards out-of-distribution generalization in large-scale astronomical surveys: robust networks learn similar representations

Published 29 Nov 2023 in astro-ph.IM, astro-ph.GA, and cs.LG | (2311.18007v1)

Abstract: The generalization of ML models to out-of-distribution (OOD) examples remains a key challenge in extracting information from upcoming astronomical surveys. Interpretability approaches are a natural way to gain insights into the OOD generalization problem. We use Centered Kernel Alignment (CKA), a similarity measure metric of neural network representations, to examine the relationship between representation similarity and performance of pre-trained Convolutional Neural Networks (CNNs) on the CAMELS Multifield Dataset. We find that when models are robust to a distribution shift, they produce substantially different representations across their layers on OOD data. However, when they fail to generalize, these representations change less from layer to layer on OOD data. We discuss the potential application of similarity representation in guiding model design, training strategy, and mitigating the OOD problem by incorporating CKA as an inductive bias during training.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (12)
  1. Building Trustworthy Machine Learning Models for Astronomy. arXiv e-prints, art. arXiv:2111.14566, November 2021. doi: 10.48550/arXiv.2111.14566.
  2. Interpreting deep learning models for weak lensing. Phys. Rev. D, 102:123506, Dec 2020. doi: 10.1103/PhysRevD.102.123506. URL https://link.aps.org/doi/10.1103/PhysRevD.102.123506.
  3. Learning from the machine: interpreting machine learning algorithms for point- and extended-source classification. Monthly Notices of the Royal Astronomical Society, 481(3):4194–4205, 09 2018. ISSN 0035-8711. doi: 10.1093/mnras/sty2575. URL https://doi.org/10.1093/mnras/sty2575.
  4. John F. Wu. Connecting optical morphology, environment, and h i mass fraction for low-redshift galaxies using deep learning. The Astrophysical Journal, 900(2):142, sep 2020. doi: 10.3847/1538-4357/abacbb. URL https://dx.doi.org/10.3847/1538-4357/abacbb.
  5. Miles Cranmer. Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl. arXiv e-prints, art. arXiv:2305.01582, May 2023. doi: 10.48550/arXiv.2305.01582.
  6. Understanding Robust Learning through the Lens of Representation Similarities. arXiv e-prints, art. arXiv:2206.09868, June 2022. doi: 10.48550/arXiv.2206.09868.
  7. The CAMELS Multifield Dataset: Learning the Universe’s Fundamental Parameters with Artificial Intelligence. arXiv e-prints, art. arXiv:2109.10915, September 2021a.
  8. The CAMELS Project: Cosmology and Astrophysics with Machine-learning Simulations. Astrophysical Journal, 915(1):71, July 2021b. doi: 10.3847/1538-4357/abf7ba.
  9. Similarity of Neural Network Representations Revisited. arXiv e-prints, art. arXiv:1905.00414, May 2019. doi: 10.48550/arXiv.1905.00414.
  10. Do wide and deep networks learn the same things? uncovering how neural network representations vary with width and depth, 2021.
  11. Multifield Cosmology with Artificial Intelligence. arXiv e-prints, art. arXiv:2109.09747, September 2021c. doi: 10.48550/arXiv.2109.09747.
  12. The camels multifield data set: Learning the universe’s fundamental parameters with artificial intelligence. The Astrophysical Journal Supplement Series, 259(2):61, apr 2022. doi: 10.3847/1538-4365/ac5ab0. URL https://dx.doi.org/10.3847/1538-4365/ac5ab0.
Citations (1)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.