Irregular Scaling Behaviors

Updated 8 July 2025

Irregular scaling behaviors are deviations from simple power-law relations, exhibiting non-monotonic, singular, or non-analytic dependencies across diverse systems.
They are identified through a combination of empirical observations and theoretical models, including finite-size analyses, multifractal spectra, and simulation diagnostics.
Recognizing these irregularities guides the refinement of predictive models in fields like phase transitions, disordered materials, and neural network scaling.

Irregular scaling behaviors refer to the empirically observed and theoretically motivated deviations from simple, predictable scaling laws that describe how certain physical, statistical, or computational quantities depend on system size, parameters, or other relevant metrics. These behaviors manifest as non-monotonic, non-power-law, non-universal, or even singular dependencies, challenging the classical expectation of regular, universal scaling and undermining the extrapolative power of classical scaling techniques in a wide variety of contexts in physics, mathematics, materials science, information theory, and machine learning.

1. Foundational Concepts and Manifestations of Irregular Scaling

Scaling laws characterize how key observables change as some system attribute (e.g., size, time, energy, or model capacity) is varied. Regular scaling behaviors—where a quantity $Y$ depends on some parameter $P$ via a simple law such as $Y \propto P^{\beta}$ —are ubiquitous in critical phenomena, urban science, and complexity theory. Irregular scaling arises when this dependence is modified by system-specific features: the scaling exponent becomes non-universal or varies continuously; corrections become dominant; the scaling is non-analytic or even singular; or the metric for scaling itself becomes ambiguous due to system heterogeneity.

Such irregularities are documented in:

Systems displaying essential singularities or scaling functions parameterized by the initial condition tail exponents (Chen et al., 2020);
Downstream performance in LLMs, showing emergence, inverse, or nonmonotonic scaling (Lourie et al., 1 Jul 2025);
Disordered or aperiodic systems, where multifractal spectra and nonmonotonic crossovers emerge in scaling observables (Jagannathan et al., 2018);
Non-Euclidean (e.g., hyperbolic) spaces, where classical analytic scaling theorems fail, and leading-order expectations are replaced or suppressed (Huang et al., 2024);
Systems subject to size-dependent or marginal perturbations, producing amplitude-modulated scaling without modifying the leading exponents (Turban, 2023).

Irregular scaling behaviors thus reflect and reveal the failure points of universality, the breakdown of equilibrium assumptions, or the critical impact of model-specific symmetry, boundary, disorder, or coupling properties.

2. Theoretical Origins and Model Systems

a) Criticality and Coalescence Processes

Irregular scaling often emerges near critical manifolds. The coalescence equation studied in (Chen et al., 2020) presents a family of scaling functions $F(\xi)$ satisfying

$f(x, t) = \frac{1}{(t + t_0)^2} F\left(\frac{x}{t + t_0}\right)$

with

$\xi F'(\xi) + 2 F(\xi) + F'(\xi) + \frac{1}{2}\int_0^\xi F(y) F(\xi - y) dy = 0.$

The large- $\xi$ tail of $F(\xi)$ depends on an initial-condition-sensitive exponent, and the order parameter’s singularity is of essential type:

$\mathcal{F}_{\infty} \sim \exp(-\Delta^{-1/(α-2)+o(1)})$

This is non-analytic and “irregular,” as the same system with different initial tail exponent $\alpha$ yields different singular behaviors—a clear departure from standard scaling.

b) Pattern Formation with Continuous Symmetry

Spatiotemporal chaotic systems with continuous symmetry (e.g., the Nikolaevskiy model (Wittenberg et al., 2010)) can display “anomalous scaling” in the amplitude and correlation times of long-wave (Goldstone) modes. For example, while multiple-scale analysis might predict a scaling $\langle|U_0|\rangle \sim r$ for the large-scale mode amplitude, simulations reveal $\langle|U_0|\rangle \sim r^{7/8}$ . Inclusion of higher-order corrections to amplitude equations is required to restore agreement with observations, and the anomalous divergence of correlation times (scaling as $\varepsilon^{-\delta}$ , $\delta\sim 4/3$ ) signals that leading-order reductions may overlook asymptotically dominant, next-order effects.

c) Disordered Systems and Nonmonotonic Crossover

Disordered 1D quasicrystals display nonmonotonic (“re-entrant”) scaling in the inverse participation ratio (IPR): for certain critical states, the IPR first decreases (delocalization) before increasing with disorder (localization), as shown in (Jagannathan et al., 2018). This is traced analytically to perturbative level repulsion and is strongly determined by renormalization-group path dependence, indicating the essential role of hierarchical or aperiodic underlying structure in irregular scaling.

3. Dependence on System-Specific Metrics and Non-Universality

A hallmark of irregular scaling is the breakdown of universality: not only are exponents non-universal, but even the appropriate scaling variables differ.

Graph Neural Networks: In graph learning, the traditional count of graphs is replaced by total edge number, as the data volume for scaling is more faithfully measured by edge count due to high variability in graph sizes (Liu et al., 2024). The empirically observed scaling law,

$s = -a (X + c)^b + s_\infty,$

only captures performance accurately when $X$ is the total number of edges. Additionally, the model depth (number of GNN layers) significantly influences the scaling law, in sharp contrast to regular (and deep) neural scaling in computer vision and NLP, revealing a new axis of irregularity.

Nanocrystalline Plasticity: Scaling exponents for avalanche distributions (e.g., $\tau$ in $p(E) \sim E^{-\tau} e^{-E/E_c}$ ) vary non-universally with quenched disorder and system size (Zhang et al., 2020). The exponents change abruptly across a brittle-to-ductile transition, and whether the disorder is elastically compatible or incompatible determines if irregular (non-universal) scaling persists.
Radial Basis Functions (RBFs): The optimal scaling parameter for RBF interpolation is not universally determined by the data: for analytic kernels, only band-limited functions retain scalability across all $\varepsilon$ (Larsson et al., 2022). In general, the error can have a complex or even indeterminate dependence on the scale parameter, as the flat limit may or may not be optimal depending on both the function and the data.

4. Off-Equilibrium and Dynamical Irregular Scaling

In systems driven across a phase transition by slow, time-dependent external parameters, off-equilibrium scaling behavior can be markedly irregular:

First-Order and Continuous Transitions: In both first-order (Potts model) (Panagopoulos et al., 2015) and continuous (O(N) vector model) (Pelissetto et al., 2015) transitions, scaling observables (renormalized magnetization, energy density, or hysteresis area) obey scaling forms with non-analytic dependence on the driving timescale, system size, and geometry-boundary exponents. These forms introduce multi-variable scaling functions,

$m_r(t, t_s, L) \simeq f_m(u, w),\quad u = t_s^{\kappa}/L,\quad w = t/t_s^{\kappa_t},$

whose irregularity reflects the interface dynamics, boundary-induced criticality, and system history. This framework generalizes the Kibble-Zurek mechanism and reveals that scaling may only emerge under certain (mixed boundary) conditions rather than universally.

5. Non-Predictability and Meta-Analysis in Machine Learning

Extensive meta-analysis in downstream scaling of large pre-trained models reveals that only a minority of tasks (39%) show regular, linear scaling under monotonic transformation of pretraining loss (Lourie et al., 1 Jul 2025). In contrast, the majority of tasks evidence:

Emergence: Abrupt appearance of capability above a scale threshold;
Inverse scaling: Performance degradation with increased model scale;
Sensitivity to experimental setup: Different validation data or seemingly minor implementation choices can qualitatively reverse scaling trends.

The commonly used functional model,

$y = a \cdot \exp\{c x\} + b,$

only fits within local contexts and fails under presence of emergent or inverse scaling, indicating that the relationship between size and performance can undergo structural breaks and cannot be generically extrapolated.

6. Mathematical and Physical Consequences

Irregular scaling phenomena necessitate:

Refinement of scaling theory, including generalizations of finite-size scaling in the presence of size-dependent perturbations (Turban, 2023), where the leading exponent may remain the same but amplitudes become universal functions of the perturbation strength, or essential singularities appear.
Reformulation of scaling observables to use system-sensitive variables (e.g., edge count for graphs, system-specific exponents and corrections in finite-size scaling, or local exponents in urban scaling studies (Barthelemy, 2019)).
Recognition of limitations in data-sufficiency for kernel methods: for kernels with unlimited smoothness, the interpolation data alone may be insufficient to determine error-optimal scales (Larsson et al., 2022).

7. Implications, Applications, and Future Directions

Recognizing and characterizing irregular scaling is fundamental for reliable modeling and prediction in:

Statistical mechanics and phase transitions: Designing protocols for driven transitions, understanding hysteresis, or analyzing critical phenomena with marginal or dangerous irrelevant variables.
Disordered and aperiodic materials: Predicting localization, robustness, and transport.
Communication theory: Designing LDPC codes and predicting error rates in the finite block-length regime with irregular degree distributions (Nozaki et al., 2010).
Machine learning at scale: Developing reliable scaling strategies for neural architectures, graph models, and forecasting task transferability.
Urban science and complex systems: Quantitatively distinguishing threshold behavior, effective exponents, and scaling regimes.

Ongoing investigations aim to:

Develop analytic frameworks for scaling in non-Euclidean geometries (Huang et al., 2024);
Identify universality classes and minimal sufficient scaling variables;
Construct experimental diagnostics and regression tools (e.g., “tomography” scans of local exponents) to detect and interpret irregular scaling (Barthelemy, 2019).

In conclusion, irregular scaling behaviors highlight the inadequacy of naive extrapolation based on regular scaling laws, demanding careful, system-specific theoretical and empirical analysis to avoid erroneous predictions and to exploit the complex behaviors accessible only at intermediate or marginal parameter regimes. As such, they represent both a challenge and an opportunity in the modeling and analysis of complex systems across disciplines.