CL-ETAS Model Extensions
- CL-ETAS Model is a generalized extension of the classical ETAS framework that incorporates innovative triggering mechanisms like clock-advance and regime-switching to enhance seismic forecasting.
- It replaces traditional magnitude cutoffs with productivity moments and Lévy jump processes, enabling refined analysis of microseismicity and infinite-generation aftershock sequences.
- Hybrid models that integrate deep learning with physical ETAS structures significantly improve forecast accuracy by capturing non-Poisson clustering and complex memory effects.
The term CL-ETAS refers to several distinct, advanced formulations of the Epidemic-Type Aftershock Sequence (ETAS) model for earthquake clustering, each extending the classical ETAS framework in fundamental ways to enhance the modeling of seismicity. These extensions address issues such as the incorporation of microseismicity without magnitude cutoffs, the empirical two-regime memory in aftershock sequences, catalog-level non-Poissonian clustering, and hybridization with deep learning architectures. The principal variations, as detailed in recent literature, include (1) the Clock-Advance ETAS (CL-ETAS) model, (2) the two-regime memory CL-ETAS/ETAS2, (3) the Catalog-Level ETAS (CL-ETAS) with non-Poisson clustering, and (4) the ConvLSTM–ETAS hybrid (CL-ETAS). Each targets distinct theoretical or operational shortcomings in the standard ETAS approach.
1. Core Principles and Theoretical Modifications
The CL-ETAS family generalizes the ETAS paradigm by reinterpreting event triggering and branching mechanisms. A central innovation is the shift from direct additive rate modification to either time-changed, infinite-generation Markovian superpositions (clock-advance) or to catalog-level, non-Poissonian resampling procedures (catalog-level clustering). In certain formulations, the model replaces the classical magnitude process with productivity moments, introducing Lévy jump processes to address infinite-branching rates at low magnitudes, with convergence controlled by the mean productivity rather than ad-hoc cutoffs (Holschneider, 2 Jan 2025).
Alternate formulations introduce regime-switching or random aftershock compression factors to more faithfully reproduce observed short- and long-term memory exponents and the high-variance clustering signatures that characterize real earthquake sequences (Zhang et al., 2020, Baughman et al., 2023). Hybridizations with deep spatiotemporal neural networks further integrate data-driven pattern extraction with physical rate models (Zhang et al., 2023).
2. Mathematical Structure and Conditional Intensity
a) Clock-Advance ETAS (Holschneider, 2 Jan 2025)
The conditional rate function is reparameterized to describe a process on productivity moment , with Lévy jump density: assuming for finite mean.
The conditional intensity (hazard) becomes: where is the Omori–Utsu kernel.
Triggered events correspond to copies of the background Lévy process, but run on accelerated "clocks" determined by the moment and Omori integral of each ancestor event:
Infinite generations are constructed by Markovian recursion and summed for the total process. No hard lower magnitude cutoff is applied; the convergence of the infinite sum is controlled by the mean productivity .
b) Two-Regime Memory CL-ETAS/ETAS2 (Zhang et al., 2020)
Here, the conditional intensity includes a dual-regime aftershock productivity: with
where is the crossover count, and spatial and temporal kernels follow established ETAS forms.
c) Catalog-level Non-Poisson Clustering (Baughman et al., 2023)
The triggering kernel is modulated by a random cluster compression factor , where . Aftershock clusters are compressed in time post hoc, producing observed clusters that deviate from the standard Poisson hypothesis while preserving productivity and magnitude–frequency scaling: with , .
d) ConvLSTM–ETAS Hybridization (Zhang et al., 2023)
The architecture fuses ETAS one-step-ahead forecasts with observed counts as separate input channels to a 4-layer ConvLSTM with Conv3D readout, mapping sequences of magnitude-space binned tensors to next-day forecasts. The overall prediction can be characterized as: where is the trained network, implicitly discovering optimal combinations of physical ETAS structure and data-driven spatiotemporal corrections.
3. Simulation Algorithms and Computational Methods
a) Markov Chain Superposition (Holschneider, 2 Jan 2025)
The core simulation proceeds generation-wise, each iteration sampling an independent realization of the background jump process, which is then time-warped according to the cumulative moment and Omori kernel of prior generations:
summed to convergence, controlling for subcriticality (). Each realization does not require partitioning by magnitude, but rather aggregates over the productivity moment variable directly.
b) Catalog Resampling Algorithm (Baughman et al., 2023)
Simulation proceeds in two phases:
- Bare ETAS Catalog: Generate using standard maximum-likelihood–estimated branching and background parameters, in time, space, and magnitude.
- Cluster Compression: For each mainshock "cycle," sample a cluster compression factor (from ), then compress all descendant event times within the cycle: .
A high-level pseudocode appears in the literature, providing explicit workflow for catalog generation and cluster modification. Calibration of the compression parameter is performed such that Omori-law decay in the simulated catalog matches real observed -values.
c) Deep Learning Fusion (Zhang et al., 2023)
The ConvLSTM–ETAS hybrid is trained to minimize mean squared error over daily binned event tensors, with the network ingesting both raw event count tensors and one-day–ahead ETAS forecast maps. The network comprises four ConvLSTM2D layers (each with 40 filters), with a Conv3D output layer, optimized using Adam on batches of index-windowed time sequences.
Roll-forward simulation for operational forecasting involves iterative ETAS reparameterization, generation of forecast tensors, and ConvLSTM prediction updates in a moving time window.
4. Empirical Validation and Performance Characteristics
a) Clock-Advance ETAS
The convergence of the Markov sum—and therefore the model's well-posedness—is governed solely by the mean productivity . Arbitrary inclusion of microseismicity is permitted, with the total process never exploding provided . Simulations admit analytical tractability for higher moments and facilitate modular, generation-wise parallelization (Holschneider, 2 Jan 2025).
b) Memory and Forecasting Improvements
The two-regime CL-ETAS reproduces empirical scaling crossovers in conditional CDF memory measures, notably: with observed exponents , , for Italy, closely matching real data and outperforming the standard ETAS's single-regime scaling (Zhang et al., 2020). In forecasting applications (e.g., after the L’Aquila Mw6.3 mainshock), the model consistently contains observed event counts within model confidence bands, an outcome sporadic for the classical ETAS.
c) Information-Theoretic Nowcasting (Baughman et al., 2023)
Catalog-level CL-ETAS yields enhanced Predictive Positive Value (PPV) and Shannon information content (up to 0.88 bits for in synthetic catalogs), with gains aligning with observed California catalogs (MaxPG , MaxIG bits). Declustering, which removes aftershock clusters, erases the surplus information carried by non-Poissonian temporal clustering, indicating that maximal nowcasting skill requires retaining full clustered structure.
d) Hybrid Deep Learning (Zhang et al., 2023)
The hybrid CL-ETAS produces significant quantitative improvements—$25$– gain in pseudo-log-likelihood and reduction in event count RMSE relative to stand-alone ETAS or ConvLSTM—across three mainshock aftershock sequences and multiple performance metrics (N/M/S/PL tests). CL-ETAS forecasts are both more stable (narrower CI) and more physically interpretable due to the explicit physical/dynamical-ETAS input.
5. Implementation, Stability, and Inference
The modular construction—especially in the clock-advance and catalog-level models—separates independent increments (background process) from deterministic or random temporal/spatial modifications, supporting statistical inference techniques such as Poisson/Lévy-based likelihood maximization and moment-matching estimation for kernel parameters. The absence of a magnitude cutoff and applicability to microseismicity enhances analytical tractability for stability and long-time rate calculations.
Deep learning hybridizations require local retraining and regional ETAS re-parameterization, with operational usage favoring periodic re-calibration of ETAS parameters and rolling window network updates. Simulation for both "pure" and hybrid models is highly parallelizable due to independence across background or generation layers, or across unconnected spatial cells in the ConvLSTM case.
6. Operational and Research Implications
The CL-ETAS framework, in its various instantiations, enables more faithful modeling of high-density microseismic catalogs, more accurate and stable operational aftershock forecasts, enhanced interpretability (especially in fused deep learning systems), and the rigorous retention of information-theoretic content essential for nowcasting. Removal of aftershock clusters (declustering) is contraindicated when maximizing predictive content (Baughman et al., 2023).
A plausible implication is that future ETAS-based systems will increasingly adopt either time-change/Lévy-process structures or hybrid data-driven/physical models to capture non-Poissonian clustering, scale breaks in memory exponents, and microseismicity, leveraging these innovations for enhanced hazard quantification and ensemble-based operational forecasting.