CLAMATO: Lyman‑α IGM Tomography in COSMOS
- CLAMATO is a pioneering high‑redshift survey that uses Lyman‑α forest absorption to reconstruct three-dimensional maps of the cosmic web at z between 2 and 3.
- The project employs advanced tomographic methods such as Wiener filtering and TARDIS-based forward modeling to resolve IGM structures on megaparsec scales.
- It integrates extensive spectroscopic data to correlate IGM features with galaxy distributions, providing new constraints on galaxy evolution during cosmic noon.
The COSMOS Lyman-Alpha Mapping And Tomography Observations (CLAMATO) project is a pioneering spectroscopic survey that uses the Lyman-α forest absorption in background galaxy and quasar spectra to reconstruct high-resolution three-dimensional maps of the intergalactic medium (IGM) and cosmic web at $2 < z < 3$. CLAMATO leverages the dense sampling capabilities of faint star-forming galaxies in the COSMOS field and advanced tomographic reconstruction methods to trace the IGM’s structure—voids, filaments, sheets, and nodes—on comoving scales of a few megaparsecs, providing new constraints on the relationship between galaxies and large-scale structure during the peak epoch of star formation.
1. Scientific Motivation and Survey Design
CLAMATO was conceived to overcome the sparse sightline limitations of traditional QSO-based Lyman-α forest studies by utilizing faint Lyman-break galaxies as alternative background sources. By doing so, CLAMATO achieves a differential sightline density exceeding at –25.0, resulting in mean transverse sightline separations of – in the COSMOS field at (Lee et al., 2013). This dense sightline distribution enables tomographic mapping of the IGM on megaparsec scales—small enough to resolve the cosmic web’s topology—across a contiguous area.
The survey strategy involves moderate-resolution (–$1300$) spectroscopy targeting both star-forming galaxies and quasars at $2.05 < z < 3.0$, using the Keck-I/LRIS instrument. In the first and second public data releases, CLAMATO obtained spectra for 240 (DR1) and then 320 (DR2) background sources over , covering a comoving volume of and achieving a spatial resolution of (Horowitz et al., 2021, Lee et al., 2017).
2. tomographic Reconstruction: Methods and Algorithms
The core of CLAMATO’s data analysis is the tomographic reconstruction of the three-dimensional IGM absorption field, , from noisy and irregularly spaced sightlines. The standard method employs a Wiener filter approach (Lee et al., 2014, Lee et al., 2017, Horowitz et al., 2021), modeling correlations through a Gaussian covariance:
where and are the line-of-sight and transverse correlation lengths, typically and ; is the variance of normalized flux fluctuations. The Wiener-reconstructed field is given by
where is the diagonal noise covariance matrix.
Advanced techniques, such as the Optimized Reconstruction with Constraints on Absorption (ORCA) (Li et al., 2021), replace the global matrix inversion by a multiscale, annealed optimization of a physically-motivated loss function:
Here, the physical clipping terms ensure the reconstructed transmission is bounded within physically allowed ranges, yielding improved recovery of extremal features such as voids and proto-clusters relative to Wiener filtering.
For dark matter field inference, CLAMATO utilizes a forward-modeling approach based on the TARDIS algorithm (Horowitz et al., 2019, Byrohl et al., 27 Sep 2024). This method reconstructs the initial density field by maximizing the joint likelihood of galaxy and Lyα forest data, then evolves the field using fast approximate gravity solvers, yielding models that—after calibration through the fluctuating Gunn-Peterson approximation (FGPA)—constrain the three-dimensional matter distribution and cosmic web classification (via the Hessian of the gravitational potential).
3. High-Redshift Cosmic Web: Mapping and Characterization
The dense CLAMATO data enable Mpc-scale mapping of the cosmic web at . The reconstructed IGM field at this epoch exhibits all canonical cosmic web elements: voids, sheets, filaments, and nodes (Horowitz et al., 2021, Krolewski et al., 2017). Cosmic web classification utilizes the eigenvalues of the deformation (tidal) tensor:
where is the gravitational potential sourced from the reconstructed density. Each spatial location is classified by the number of positive eigenvalues above a threshold, with nodes (+++), filaments (++–), sheets (+––), and voids (–––).
Quantitative cosmic web statistics—such as volume fractions, object counts, and eigenvector alignments—are validated against cosmological hydrodynamical (e.g., Nyx) and -body simulations. TARDIS-based reconstructions recover eigenvalues with correlations (for ELT-like quality), and ORCA improves structure recovery by 6% in void volume overlap relative to Wiener filtering.
CLAMATO also provides the first detection of cosmic voids at (Krolewski et al., 2017). Voids are identified using a spherical underdensity algorithm tuned to the Lyα transmission field, with selection thresholds calibrated on simulations to yield consistent volume fractions (18–19.5%). Stacked void profiles, radius functions, and redshift-space distortions are in close agreement with simulated predictions, enabling Alcock-Paczynski cosmological tests and investigation of modified gravity (Stark et al., 2015).
Protoclusters—overdense regions destined to evolve into massive galaxy clusters—are identified as large-scale absorption decrements spanning , robustly recovered at 90% purity and 75% completeness in idealized and mock reconstructions at CLAMATO’s sightline density (Stark et al., 2014).
4. Relationship to Galaxy Distributions and Galaxy Evolution
A key strength of CLAMATO is the ability to correlate 3D IGM structures with extensive galaxy redshift samples in COSMOS. Extensive cross-correlation analysis between CLAMATO’s Lyα forest maps and 1642 foreground galaxies (from CLAMATO, zCOSMOS-Deep, MOSDEF, VUDS, 3D-HST) measures the 3D cross-correlation function and constrains the galaxy bias as a function of stellar mass (Zhang et al., 9 Sep 2025). The estimator for the cross-correlation is:
Modeling the observed cross-power spectrum employs a redshift-space distortion formalism, accounting for line-of-sight velocity dispersion and systematic redshift offsets (notably, for UV absorption line-based redshifts). The observed "fingers-of-god" effect in MOSDEF (dispersions above ) reflects expected intra-halo velocity dispersions within overdense regions.
Splitting the galaxy sample by stellar mass, direct constraints on the stellar mass–halo mass relation (SHMR) at are achieved:
Median | Galaxy Bias | Typical |
---|---|---|
9.23 | 2.85 | 10.27 |
9.71 | 3.32 | 11.55 |
10.21 | 4.71 | 12.06 |
Comparison with Bolshoi-Planck simulations and IllustrisTNG finds good consistency for mid-/high-mass bins, but the low-mass bin indicates that low-mass galaxies may reside in halos 2 orders of magnitude less massive than some empirical models predict, suggesting elevated star formation efficiency in these halos at "cosmic noon".
Quantitative comparison of IGM voids and galaxy redshifts demonstrates that CLAMATO-identified voids are underdense in galaxies at significance compared to random regions (Krolewski et al., 2017). Visual and quantitative correlation of absorption peaks and known protoclusters further validates CLAMATO's ability to trace the environments of large-scale structure and galaxy evolution (Lee et al., 2017).
5. Synergy with Simulation and Modeling: TARDIS and cosmosTNG
The initial density fields reconstructed by CLAMATO have been used to set constrained initial conditions for the cosmosTNG hydrodynamical simulation suite (Byrohl et al., 27 Sep 2024). Using the TARDIS algorithm, the reconstructed density field from CLAMATO and zCOSMOS is evolved forward in AREPO with IllustrisTNG physics and baryonic mass resolution . Eight constrained realizations are simulated to sample stochasticity on small scales.
This approach allows for detailed environmental studies of the COSMOS field at :
- The simulated region exhibits an overabundance of massive halos and galaxies, especially in the zFIRE protocluster.
- The cosmic star formation rate density and the abundance of high-mass galaxies are higher and peak earlier relative to random-field simulations.
- The simulated galaxies display a lower quenched fraction at fixed mass by 20% for – compared to field expectations.
- Supermassive black holes are systematically overmassive by 0.5 dex at fixed stellar mass, consistent with JWST observations.
- Strong radial gradients in sSFR, gas fraction, and galaxy sizes with distance from protocluster centers reveal gas depletion and quenching consistent with environmental effects.
This demonstrates CLAMATO’s ability not only to map cosmic structure but also to supply physically-constrained environments for forward modeling of galaxy formation and feedback.
6. Data Releases, Community Resources, and Future Prospects
CLAMATO has released all reduced spectra, Lyα forest pixel data, and tomographic maps to the public (Lee et al., 2017, Horowitz et al., 2021). These releases include traditional figures, interactive 3D HTML visualizations, and VR panoramic explorations, enabling the community to analyze, validate, and extend the survey’s scientific reach.
Future directions indicated by the CLAMATO methodology and results include:
- Larger-area tomography (e.g., with Subaru-PFS or Maunakea Spectroscopic Explorer) to increase the void and protocluster sample sizes, improving cosmological constraints (AP test, growth rate, modified gravity).
- Cross-correlation with 21 cm experiments and intensity mapping surveys to jointly probe the ionized and neutral components of the high- IGM (Silva et al., 2012, Pullen et al., 2013).
- Application of improved inversion algorithms (TARDIS, ORCA) and denser background source sampling (with ELTs or high-multiplex facilities) to reduce systematics and allow physically richer IGM–galaxy studies.
- Synergistic use of simulations (e.g., cosmosTNG with CLAMATO initial conditions) to interpret multi-wavelength galaxy surveys and directly model the environmental impact on galaxy formation at cosmic noon.
CLAMATO thus stands as a template for IGM tomography, establishing both the methodology and empirical foundation for the next generation of cosmic web, IGM, and galaxy surveys at high redshift.