Papers
Topics
Authors
Recent
Search
2000 character limit reached

COLA-f: Cosmological Simulations & Conformal Prediction

Updated 19 November 2025
  • COLA-f is a dual-purpose method: in cosmology, it simulates scale-dependent structure growth for modified gravity, while in statistics it provides exact predictive inference via full-conformal allocation.
  • In cosmological simulations, COLA-f combines k-dependent Lagrangian perturbation theory with an efficient screening solver to reproduce matter power spectra and halo mass functions within 1–2% accuracy at a fraction of full N-body computational cost.
  • In predictive inference, COLA-f optimally allocates miscoverage to ensure exact finite-sample coverage, though its steep computational scaling restricts its use to smaller datasets or few classification labels.

The term COLA-f designates two distinct but unrelated advanced computational methods in contemporary research literature: (1) COLA-f in cosmological simulations, where it refers to the COmoving Lagrangian Acceleration method with scale-dependent growth and approximate modified gravity screening for efficient large-scale structure modeling; and (2) COLA-f in the context of conformal prediction, where it refers to the full-conformal α-allocation variant in predictive inference. Both domains employ the label COLA-f for methods involving the allocation or decomposition of key simulation or statistical resources to maximize accuracy and efficiency under practical constraints.

1. COLA-f in Cosmological Simulations: Scale-Dependent Growth with Screening

COLA-f, as introduced in "COLA with scale-dependent growth: applications to screened modified gravity models" (Winther et al., 2017), is a parallelized code that extends the COmoving Lagrangian Acceleration (COLA) formalism to simulations of cosmic structure formation in modified gravity models exhibiting scale-dependent linear and second-order growth. The primary innovation is integration of kk-dependent 1LPT/2LPT displacements combined with an approximate, efficient screening solver, making precision structure formation calculations tractable even for non-standard gravity scenarios such as f(R)f(R) gravity and nDGP.

In the COLA framework, particle positions are split as

x(q,τ)=q+ΨLPT(q,τ)+δx(q,τ)x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)

where ΨLPT\Psi_{\rm LPT} contains 2nd-order Lagrangian perturbation theory (2LPT) displacements and δx\delta x is the non-perturbative residual evolved by a low-cost N-body mesh integration. For models with scale-dependent linear growth, both 1LPT and 2LPT displacements are kk-dependent; the corresponding ODEs are solved on a grid of wavenumbers, and the structure formation is advanced with few time-steps via FFT-based operations. For f(R)f(R)-gravity, screening enters through a chameleon-type field equation, with a fast screening factor applied to suppress the fifth force in high-density regions.

Numerical validation against full N-body simulations in f(R)f(R) and nDGP shows that COLA-f reproduces key observables—the matter power spectrum and halo mass function boosts—at the $1$–2%2\% level up to f(R)f(R)0Mpc, for f(R)f(R)1 time steps, yet at f(R)f(R)2 lower computational cost than full-resolution N-body codes. This technical achievement enables routine execution of precision large ensemble simulations required for survey covariance, emulator construction, and mock galaxy catalog generation in modified gravity cosmologies (Winther et al., 2017).

2. Theoretical Foundations: Modified Gravity, LPT, and Screening

COLA-f is constructed for models where the linear growth function is f(R)f(R)3-dependent, typically due to scale-dependent modifications of gravity. Examples include f(R)f(R)4 gravity, characterized in the Hu–Sawicki form by an extra scalaron degree of freedom, whose mass f(R)f(R)5 determines the range of the fifth force:

f(R)f(R)6

Scale dependence propagates into both first- and second-order LPT displacements.

For f(R)f(R)7, the scalaron field equation and screening are handled by the approximation:

f(R)f(R)8

with f(R)f(R)9, yielding rapid suppression of the modified force where Newtonian potentials are deep, without requiring a full nonlinear multigrid solution.

To render the approach efficient, COLA-f replaces the full 3D integrals of 2LPT with an ansatz for the 2LPT kernel and computes all displacements on FFT grids. As a result, most computational effort is shifted to FFTs, domain decompositions, and local PM operations (Winther et al., 2017).

3. Numerical Scheme, Scalability, and Performance

The COLA-f algorithm is designed for distributed parallelism, dividing the computational domain using MPI. Each time-step involves:

  • Solving ODEs for x(q,Ï„)=q+ΨLPT(q,Ï„)+δx(q,Ï„)x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)0 and the approximate x(q,Ï„)=q+ΨLPT(q,Ï„)+δx(q,Ï„)x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)1,
  • FFT operations for displacement fields,
  • Density assignment and FFT-based gravity solves,
  • Application of the screening factor,
  • Leapfrog integration of the residual displacement and velocity.

Additional memory overhead arises from storing multiple x(q,τ)=q+ΨLPT(q,τ)+δx(q,τ)x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)2-space displacement arrays and per-particle LPT derivatives. Particle exchanges occur when Lagrangian displacements traverse subdomain boundaries, tracked via home-CPU IDs and initial coordinates.

Empirically,

  • 10 time steps provide Px(q,Ï„)=q+ΨLPT(q,Ï„)+δx(q,Ï„)x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)3 and halo mass function accuracy within x(q,Ï„)=q+ΨLPT(q,Ï„)+δx(q,Ï„)x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)4–x(q,Ï„)=q+ΨLPT(q,Ï„)+δx(q,Ï„)x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)5 for x(q,Ï„)=q+ΨLPT(q,Ï„)+δx(q,Ï„)x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)6/Mpc.
  • Increasing to 20–30 steps yields percent-level accuracy to x(q,Ï„)=q+ΨLPT(q,Ï„)+δx(q,Ï„)x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)7Mpc and across halo masses x(q,Ï„)=q+ΨLPT(q,Ï„)+δx(q,Ï„)x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)8–x(q,Ï„)=q+ΨLPT(q,Ï„)+δx(q,Ï„)x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)9.

When compared to full N-body, COLA-f achieves ΨLPT\Psi_{\rm LPT}0 speed-ups, with a 3-4ΨLPT\Psi_{\rm LPT}1 slowdown when scale-dependent growth and screening are included relative to standard COLA (Winther et al., 2017).

4. Accuracy, Validation, and Scientific Applications

Benchmarking against high-resolution N-body datasets shows:

  • Matter power spectrum ΨLPT\Psi_{\rm LPT}2 boosts for ΨLPT\Psi_{\rm LPT}3: accuracy within ΨLPT\Psi_{\rm LPT}4–ΨLPT\Psi_{\rm LPT}5 up to ΨLPT\Psi_{\rm LPT}6Mpc;
  • Halo mass function ratios within ΨLPT\Psi_{\rm LPT}7 (F5), slightly underestimating for lighter halos in F6 due to screening approximation limitations;
  • Velocity divergence spectra accurate at the ΨLPT\Psi_{\rm LPT}8–ΨLPT\Psi_{\rm LPT}9 level to δx\delta x0Mpc.
  • For nDGP, δx\delta x1 and halo function boosts are reproduced within δx\delta x2.

COLA-f enables the construction of large ensembles of mock galaxy, halo, and dark matter catalogs under δx\delta x3 or other scale-dependent gravity, crucial for covariance estimation, emulator calibration, and data analysis for large-scale structure experiments.

5. COLA-f in Conformal Prediction: Full-Conformal Aggregation

An unrelated, independent usage of COLA-f arises in predictive inference, denoting the full-conformal α-allocation method for constructing conformal prediction sets with exact finite-sample coverage (Xu et al., 15 Nov 2025). In this context, given δx\delta x4 nonconformity scores and a calibration dataset δx\delta x5, COLA-f allocates the total miscoverage δx\delta x6 across the δx\delta x7 sets so as to minimize average prediction set size, recalibrating the allocation vector δx\delta x8 for each possible label δx\delta x9 using the augmented sample of kk0 points.

The COLA-f set for a new input kk1 is

kk2

where each allocation kk3 is chosen to minimize mean set size, treating the candidate label kk4 symmetrically with the observed calibration responses. This exact finite-sample symmetry restores marginal coverage kk5, at the expense of kk6 computational cost, restricting practical use to small kk7 or small kk8.

Table: COLA-f Algorithmic and Empirical Characteristics (Xu et al., 15 Nov 2025)

Feature Implementation Empirical Performance
Coverage Guarantee Finite-sample marginal (kk9) Achieved for all f(R)f(R)0
Optimality Objective Minimize average set size Shorter sets for small f(R)f(R)1
Computational Scaling f(R)f(R)2 71–1793s per test sample for f(R)f(R)3–f(R)f(R)4
Recommended Use Case f(R)f(R)5, f(R)f(R)6 -

COLA-f in this sense is most useful where sample size is small and exact validity is required; for large-scale problems, sample-split (COLA-s) or asymptotic (COLA-e) methods are strongly preferred due to computational intractability (Xu et al., 15 Nov 2025).

6. Limitations and Future Prospects

Cosmological COLA-f:

  • The reliance on linear screening and approximate 2LPT kernels implies underestimation of higher-order statistics' MG signals, specifically in the deeply screened regime and the reduced bispectrum of dark matter (Fiorini et al., 2022).
  • The method does not resolve non-spherical screening or small-scale halo substructure, requiring external calibrations or empirical fits for high-fidelity galaxy/habitat population models (Fiorini et al., 2021).
  • Pushing to strongly non-linear or baryon-dominated (f(R)f(R)7Mpc) scales requires higher mesh resolution, symplectic integrators, or field-level emulation approaches.

Conformal Prediction COLA-f:

  • Exact finite-sample guarantees come at steep computational cost, limiting feasibility to classification problems with few labels or regression problems with coarse label grids and small calibration sets.
  • No non-asymptotic efficiency bounds are currently available; theoretical and algorithmic advances for more efficient full-conformal set aggregation remain open (Xu et al., 15 Nov 2025).

7. Summary of Impact and Usage

In cosmology, COLA-f has enabled percent-level simulation of structure formation in modified gravity models at orders-of-magnitude reduced computational cost, directly supporting mock catalog production, power-spectrum emulation, and forecast analyses for Stage IV galaxy surveys (Winther et al., 2017). In statistics, COLA-f offers a conceptually optimal but computationally intensive solution for combining multiple scoring rules in predictive inference, providing exact marginal coverage (Xu et al., 15 Nov 2025). Future work in both fields aims to relax computational constraints while retaining optimality or accuracy guarantees.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to COLA-f.