COLA-f: Cosmological Simulations & Conformal Prediction

Updated 19 November 2025

COLA-f is a dual-purpose method: in cosmology, it simulates scale-dependent structure growth for modified gravity, while in statistics it provides exact predictive inference via full-conformal allocation.
In cosmological simulations, COLA-f combines k-dependent Lagrangian perturbation theory with an efficient screening solver to reproduce matter power spectra and halo mass functions within 1–2% accuracy at a fraction of full N-body computational cost.
In predictive inference, COLA-f optimally allocates miscoverage to ensure exact finite-sample coverage, though its steep computational scaling restricts its use to smaller datasets or few classification labels.

The term COLA-f designates two distinct but unrelated advanced computational methods in contemporary research literature: (1) COLA-f in cosmological simulations, where it refers to the COmoving Lagrangian Acceleration method with scale-dependent growth and approximate modified gravity screening for efficient large-scale structure modeling; and (2) COLA-f in the context of conformal prediction, where it refers to the full-conformal α-allocation variant in predictive inference. Both domains employ the label COLA-f for methods involving the allocation or decomposition of key simulation or statistical resources to maximize accuracy and efficiency under practical constraints.

1. COLA-f in Cosmological Simulations: Scale-Dependent Growth with Screening

COLA-f, as introduced in "COLA with scale-dependent growth: applications to screened modified gravity models" (Winther et al., 2017), is a parallelized code that extends the COmoving Lagrangian Acceleration (COLA) formalism to simulations of cosmic structure formation in modified gravity models exhibiting scale-dependent linear and second-order growth. The primary innovation is integration of $k$ -dependent 1LPT/2LPT displacements combined with an approximate, efficient screening solver, making precision structure formation calculations tractable even for non-standard gravity scenarios such as $f(R)$ gravity and nDGP.

In the COLA framework, particle positions are split as

$x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)$

where $\Psi_{\rm LPT}$ contains 2nd-order Lagrangian perturbation theory (2LPT) displacements and $\delta x$ is the non-perturbative residual evolved by a low-cost N-body mesh integration. For models with scale-dependent linear growth, both 1LPT and 2LPT displacements are $k$ -dependent; the corresponding ODEs are solved on a grid of wavenumbers, and the structure formation is advanced with few time-steps via FFT-based operations. For $f(R)$ -gravity, screening enters through a chameleon-type field equation, with a fast screening factor applied to suppress the fifth force in high-density regions.

Numerical validation against full N-body simulations in $f(R)$ and nDGP shows that COLA-f reproduces key observables—the matter power spectrum and halo mass function boosts—at the $1$– $2\%$ level up to $f(R)$ 0Mpc, for $f(R)$ 1 time steps, yet at $f(R)$ 2 lower computational cost than full-resolution N-body codes. This technical achievement enables routine execution of precision large ensemble simulations required for survey covariance, emulator construction, and mock galaxy catalog generation in modified gravity cosmologies (Winther et al., 2017).

2. Theoretical Foundations: Modified Gravity, LPT, and Screening

COLA-f is constructed for models where the linear growth function is $f(R)$ 3-dependent, typically due to scale-dependent modifications of gravity. Examples include $f(R)$ 4 gravity, characterized in the Hu–Sawicki form by an extra scalaron degree of freedom, whose mass $f(R)$ 5 determines the range of the fifth force:

$f(R)$ 6

Scale dependence propagates into both first- and second-order LPT displacements.

For $f(R)$ 7, the scalaron field equation and screening are handled by the approximation:

$f(R)$ 8

with $f(R)$ 9, yielding rapid suppression of the modified force where Newtonian potentials are deep, without requiring a full nonlinear multigrid solution.

To render the approach efficient, COLA-f replaces the full 3D integrals of 2LPT with an ansatz for the 2LPT kernel and computes all displacements on FFT grids. As a result, most computational effort is shifted to FFTs, domain decompositions, and local PM operations (Winther et al., 2017).

3. Numerical Scheme, Scalability, and Performance

The COLA-f algorithm is designed for distributed parallelism, dividing the computational domain using MPI. Each time-step involves:

Solving ODEs for $x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)$ 0 and the approximate $x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)$ 1,
FFT operations for displacement fields,
Density assignment and FFT-based gravity solves,
Application of the screening factor,
Leapfrog integration of the residual displacement and velocity.

Additional memory overhead arises from storing multiple $x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)$ 2-space displacement arrays and per-particle LPT derivatives. Particle exchanges occur when Lagrangian displacements traverse subdomain boundaries, tracked via home-CPU IDs and initial coordinates.

Empirically,

10 time steps provide P $x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)$ 3 and halo mass function accuracy within $x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)$ 4– $x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)$ 5 for $x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)$ 6/Mpc.
Increasing to 20–30 steps yields percent-level accuracy to $x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)$ 7Mpc and across halo masses $x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)$ 8– $x(q, \tau) = q + \Psi_{\rm LPT}(q, \tau) + \delta x(q, \tau)$ 9.

When compared to full N-body, COLA-f achieves $\Psi_{\rm LPT}$ 0 speed-ups, with a 3-4 $\Psi_{\rm LPT}$ 1 slowdown when scale-dependent growth and screening are included relative to standard COLA (Winther et al., 2017).

4. Accuracy, Validation, and Scientific Applications

Benchmarking against high-resolution N-body datasets shows:

Matter power spectrum $\Psi_{\rm LPT}$ 2 boosts for $\Psi_{\rm LPT}$ 3: accuracy within $\Psi_{\rm LPT}$ 4– $\Psi_{\rm LPT}$ 5 up to $\Psi_{\rm LPT}$ 6Mpc;
Halo mass function ratios within $\Psi_{\rm LPT}$ 7 (F5), slightly underestimating for lighter halos in F6 due to screening approximation limitations;
Velocity divergence spectra accurate at the $\Psi_{\rm LPT}$ 8– $\Psi_{\rm LPT}$ 9 level to $\delta x$ 0Mpc.
For nDGP, $\delta x$ 1 and halo function boosts are reproduced within $\delta x$ 2.

COLA-f enables the construction of large ensembles of mock galaxy, halo, and dark matter catalogs under $\delta x$ 3 or other scale-dependent gravity, crucial for covariance estimation, emulator calibration, and data analysis for large-scale structure experiments.

5. COLA-f in Conformal Prediction: Full-Conformal Aggregation

An unrelated, independent usage of COLA-f arises in predictive inference, denoting the full-conformal α-allocation method for constructing conformal prediction sets with exact finite-sample coverage (Xu et al., 15 Nov 2025). In this context, given $\delta x$ 4 nonconformity scores and a calibration dataset $\delta x$ 5, COLA-f allocates the total miscoverage $\delta x$ 6 across the $\delta x$ 7 sets so as to minimize average prediction set size, recalibrating the allocation vector $\delta x$ 8 for each possible label $\delta x$ 9 using the augmented sample of $k$ 0 points.

The COLA-f set for a new input $k$ 1 is

$k$ 2

where each allocation $k$ 3 is chosen to minimize mean set size, treating the candidate label $k$ 4 symmetrically with the observed calibration responses. This exact finite-sample symmetry restores marginal coverage $k$ 5, at the expense of $k$ 6 computational cost, restricting practical use to small $k$ 7 or small $k$ 8.

Table: COLA-f Algorithmic and Empirical Characteristics (Xu et al., 15 Nov 2025)

Feature	Implementation	Empirical Performance
Coverage Guarantee	Finite-sample marginal ( $k$ 9)	Achieved for all $f(R)$ 0
Optimality Objective	Minimize average set size	Shorter sets for small $f(R)$ 1
Computational Scaling	$f(R)$ 2	71–1793s per test sample for $f(R)$ 3– $f(R)$ 4
Recommended Use Case	$f(R)$ 5, $f(R)$ 6	-

COLA-f in this sense is most useful where sample size is small and exact validity is required; for large-scale problems, sample-split (COLA-s) or asymptotic (COLA-e) methods are strongly preferred due to computational intractability (Xu et al., 15 Nov 2025).

6. Limitations and Future Prospects

Cosmological COLA-f:

The reliance on linear screening and approximate 2LPT kernels implies underestimation of higher-order statistics' MG signals, specifically in the deeply screened regime and the reduced bispectrum of dark matter (Fiorini et al., 2022).
The method does not resolve non-spherical screening or small-scale halo substructure, requiring external calibrations or empirical fits for high-fidelity galaxy/habitat population models (Fiorini et al., 2021).
Pushing to strongly non-linear or baryon-dominated ( $f(R)$ 7Mpc) scales requires higher mesh resolution, symplectic integrators, or field-level emulation approaches.

Conformal Prediction COLA-f:

Exact finite-sample guarantees come at steep computational cost, limiting feasibility to classification problems with few labels or regression problems with coarse label grids and small calibration sets.
No non-asymptotic efficiency bounds are currently available; theoretical and algorithmic advances for more efficient full-conformal set aggregation remain open (Xu et al., 15 Nov 2025).

7. Summary of Impact and Usage

In cosmology, COLA-f has enabled percent-level simulation of structure formation in modified gravity models at orders-of-magnitude reduced computational cost, directly supporting mock catalog production, power-spectrum emulation, and forecast analyses for Stage IV galaxy surveys (Winther et al., 2017). In statistics, COLA-f offers a conceptually optimal but computationally intensive solution for combining multiple scoring rules in predictive inference, providing exact marginal coverage (Xu et al., 15 Nov 2025). Future work in both fields aims to relax computational constraints while retaining optimality or accuracy guarantees.

Markdown Report Issue Upgrade to Chat

References (4)

COLA with scale-dependent growth: applications to screened modified gravity models (2017)

Aggregating Conformal Prediction Sets via α-Allocation (2025)

Studying large-scale structure probes of modified gravity with COLA (2022)

Fast generation of mock galaxy catalogues in modified gravity models with COLA (2021)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to COLA-f.

COLA-f: Cosmological Simulations & Conformal Prediction

1. COLA-f in Cosmological Simulations: Scale-Dependent Growth with Screening

2. Theoretical Foundations: Modified Gravity, LPT, and Screening

3. Numerical Scheme, Scalability, and Performance

4. Accuracy, Validation, and Scientific Applications

5. COLA-f in Conformal Prediction: Full-Conformal Aggregation

6. Limitations and Future Prospects

7. Summary of Impact and Usage

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

COLA-f: Cosmological Simulations & Conformal Prediction

1. COLA-f in Cosmological Simulations: Scale-Dependent Growth with Screening

2. Theoretical Foundations: Modified Gravity, LPT, and Screening

3. Numerical Scheme, Scalability, and Performance

4. Accuracy, Validation, and Scientific Applications

5. COLA-f in Conformal Prediction: Full-Conformal Aggregation

6. Limitations and Future Prospects

7. Summary of Impact and Usage

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research