Papers
Topics
Authors
Recent
2000 character limit reached

Optimizing Photometric Redshift Training Sets I: Efficient Compression of the Galaxy Color-Redshift Relation with UMAP (2512.09032v1)

Published 9 Dec 2025 in astro-ph.GA

Abstract: Spectroscopic datasets are essential for training and calibrating photometric redshift (photo-$z$) methods. However, spectroscopic redshifts (spec-$z$'s) constitute a biased and sparse sampling of the photometric galaxy population, which creates difficulties for the common grid-based approach for mapping color to redshift using self-organizing maps (SOMs). Instead, we utilized the uniform manifold approximation and projection (UMAP) algorithm to compress a Rubin-Roman-like $ugrizyJH$ color space into a thin and densely-sampled manifold. Crucially, the manifold varies continuously and monotonically in redshift and specific star formation rate in roughly orthogonal directions. Using $\sim$110,000 COSMOS2020 many-band photo-$z$'s and $\sim$15,000 spec-$z$'s as representative and non-representative samples, respectively, we trained and tested redshift estimation from a SOM (SOM-$z$) and from nearest neighbors in UMAP space (UMAP-$k$NN-$z$). Compared to SOM-$z$, UMAP-$k$NN-$z$ exhibited smaller photo-$z$ scatter and fraction of outliers for the representative training set. When training with the highly biased spec-$z$ sample, UMAP-$k$NN-$z$ maintained similar performance, but the outlier fraction for SOM-$z$ increased by nearly three times. The physically-meaningful trends across the UMAP manifold allow for accurate redshift regression even in regions of color space sparsely populated by spectroscopic objects, which comprise nearly 25% of the photometric sample. This suggests that representative, spectroscopically-anchored training sets can be produced by interpolating between spectroscopic sources at the UMAP coordinates of photometric objects, maximizing the performance of photo-$z$ algorithms.

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 0 likes about this paper.