Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
162 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards out-of-distribution generalization in large-scale astronomical surveys: robust networks learn similar representations (2311.18007v1)

Published 29 Nov 2023 in astro-ph.IM, astro-ph.GA, and cs.LG

Abstract: The generalization of ML models to out-of-distribution (OOD) examples remains a key challenge in extracting information from upcoming astronomical surveys. Interpretability approaches are a natural way to gain insights into the OOD generalization problem. We use Centered Kernel Alignment (CKA), a similarity measure metric of neural network representations, to examine the relationship between representation similarity and performance of pre-trained Convolutional Neural Networks (CNNs) on the CAMELS Multifield Dataset. We find that when models are robust to a distribution shift, they produce substantially different representations across their layers on OOD data. However, when they fail to generalize, these representations change less from layer to layer on OOD data. We discuss the potential application of similarity representation in guiding model design, training strategy, and mitigating the OOD problem by incorporating CKA as an inductive bias during training.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (12)
  1. Building Trustworthy Machine Learning Models for Astronomy. arXiv e-prints, art. arXiv:2111.14566, November 2021. doi: 10.48550/arXiv.2111.14566.
  2. Interpreting deep learning models for weak lensing. Phys. Rev. D, 102:123506, Dec 2020. doi: 10.1103/PhysRevD.102.123506. URL https://link.aps.org/doi/10.1103/PhysRevD.102.123506.
  3. Learning from the machine: interpreting machine learning algorithms for point- and extended-source classification. Monthly Notices of the Royal Astronomical Society, 481(3):4194–4205, 09 2018. ISSN 0035-8711. doi: 10.1093/mnras/sty2575. URL https://doi.org/10.1093/mnras/sty2575.
  4. John F. Wu. Connecting optical morphology, environment, and h i mass fraction for low-redshift galaxies using deep learning. The Astrophysical Journal, 900(2):142, sep 2020. doi: 10.3847/1538-4357/abacbb. URL https://dx.doi.org/10.3847/1538-4357/abacbb.
  5. Miles Cranmer. Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl. arXiv e-prints, art. arXiv:2305.01582, May 2023. doi: 10.48550/arXiv.2305.01582.
  6. Understanding Robust Learning through the Lens of Representation Similarities. arXiv e-prints, art. arXiv:2206.09868, June 2022. doi: 10.48550/arXiv.2206.09868.
  7. The CAMELS Multifield Dataset: Learning the Universe’s Fundamental Parameters with Artificial Intelligence. arXiv e-prints, art. arXiv:2109.10915, September 2021a.
  8. The CAMELS Project: Cosmology and Astrophysics with Machine-learning Simulations. Astrophysical Journal, 915(1):71, July 2021b. doi: 10.3847/1538-4357/abf7ba.
  9. Similarity of Neural Network Representations Revisited. arXiv e-prints, art. arXiv:1905.00414, May 2019. doi: 10.48550/arXiv.1905.00414.
  10. Do wide and deep networks learn the same things? uncovering how neural network representations vary with width and depth, 2021.
  11. Multifield Cosmology with Artificial Intelligence. arXiv e-prints, art. arXiv:2109.09747, September 2021c. doi: 10.48550/arXiv.2109.09747.
  12. The camels multifield data set: Learning the universe’s fundamental parameters with artificial intelligence. The Astrophysical Journal Supplement Series, 259(2):61, apr 2022. doi: 10.3847/1538-4365/ac5ab0. URL https://dx.doi.org/10.3847/1538-4365/ac5ab0.
Citations (1)

Summary

We haven't generated a summary for this paper yet.