TRA-s: Topological Residual Asymmetry with Smoothing
- The paper introduces TRA-s, a robust method for inferring causal direction in additive-noise models using geometric persistent homology of regressor-residual clouds.
- It employs a binning strategy on reverse residuals to induce a one-dimensional signal, restoring identifiability when noise does not vanish.
- Extensive experiments validate its theoretical consistency, computational efficiency, and superior performance over traditional causal inference methods.
Topological Residual Asymmetry with Smoothing (TRA-s) defines a robust criterion for inferring bivariate causal direction in additive-noise models (ANMs) under fixed noise, leveraging geometric signatures obtained from the persistent homology of regressor-residual clouds. It addresses limitations of the original TRA method, which loses discriminatory power when the noise level is fixed and does not vanish. TRA-s employs a binning strategy that induces a one-dimensional geometric signal in the reverse residual direction, restoring identifiability. It is theoretically consistent, computationally practical, and empirically validated across diverse synthetic and real-world benchmarks (Bouchattaoui, 31 Jan 2026).
1. Motivation and Formal Definition
In the classical Topological Residual Asymmetry (TRA) method, causal direction is inferred by comparing the geometric structure of two residual clouds after copula standardization: the forward direction produces residuals that are approximately independent, yielding a 2D bulk, while the reverse direction under small noise collapses to a one-dimensional tube. The separation is quantified by a 0D persistent-homology functional computed from the Euclidean minimum spanning tree (MST) on the residual cloud, with the raw TRA score providing directionality.
However, under fixed additive noise, the reverse residuals do not collapse; their distribution remains thick, and the original TRA statistic loses its discrimination. TRA-s overcomes this by binning reverse residuals along the copula axis and averaging, converting the "thick" cloud into a binned average that again collapses to a 1D signature at mesoscopic scale, while forward residuals retain a 2D structure.
Formally, let be sampled from an additive-noise model with $\varepsilon_i \indep X_i$, . TRA-s proceeds by cross-fitting regressors in both directions, computing residuals, copula-standardizing, binning the reverse direction, and calculating the MST-based persistence profiles over fixed mesoscopic windows. The TRA-s score is then the difference in windowed TP-profiles between the forward and binned-reverse clouds. If , the method infers ; if , (Bouchattaoui, 31 Jan 2026).
2. Binning and Copula Standardization Procedure
TRA-s modifies the reverse residual cloud via a binning strategy following rank-copula standardization:
- For the reverse direction (), only the coordinate is rank-transformed: .
- is partitioned into equal-width bins: , .
- For each bin, empirical bin centers and mean residuals are computed over the points within the bin.
- The binned cloud is .
The bin number is chosen such that , , and , ensuring asymptotic validity without over-smoothing or excessive bias (Bouchattaoui, 31 Jan 2026).
3. Persistent Homology Functional and TRA-s Statistic
The core geometric statistic is a normalized windowed profile of edge-lengths in the MST of the residual cloud, specialized to a "soft window" that captures mesoscopic geometry:
$\overline{\TP_0^{[\alpha, \beta]}(\mathcal{R})} = \frac{1}{(M-1)(\beta-\alpha)} \sum_{e \in \operatorname{MST}(\mathcal{R})} \Psi_{\alpha, \beta}(\|e\|),$
with
- For the forward () copula cloud, set , , and .
- For the binned-reverse cloud (), , , .
The TRA-s test statistic is then: $\widetilde{\Delta}_n = \overline{\TP_0^{[\alpha_n, \beta_n]}(\widetilde{\mathcal R}^{(n)}_{Y\mid X})} - \overline{\TP_0^{[\widetilde\alpha_n, \widetilde\beta_n]}(\widehat{\mathcal R}^{(n)}_{X\mid Y})}.$ The direction is assigned according to the sign of (Bouchattaoui, 31 Jan 2026).
4. Theoretical Guarantees and Assumptions
Under the following conditions (cf. (Bouchattaoui, 31 Jan 2026), Assumptions 3.5–3.7):
- Forward model: , branchwise invertible, , noise sub-Gaussian.
- Reverse conditional mean continuous.
- Regression errors: cross-fitted regression risk and .
- Binning: , , .
- Reverse fluctuations: .
Theorem 3.2 (Bouchattaoui, 31 Jan 2026) establishes that, as ,
$\overline{\TP_0^{[\alpha_n, \beta_n]}(\widetilde R_{Y|X})} \to 1, \qquad \overline{\TP_0^{[\widetilde\alpha_n, \widetilde\beta_n]}(\widehat R_{X|Y})} \to 0,\qquad \widetilde{\Delta}_n \to 1,$
in probability. Therefore, a rule that declares when is consistent with vanishing abstention probability (Bouchattaoui, 31 Jan 2026).
5. Algorithmic Procedure and Computational Considerations
The TRA-s method is implemented as follows:
- Cross-fit forward and reverse regressors using -fold splits, producing residuals and .
- Copula-standardize the forward cloud using both coordinates.
- Bin the reverse residuals along copula with bins, compute bin mean coordinates .
- Compute MSTs and mesoscopic persistence profiles in both directions.
- Form the TRA-s statistic and make a directional decision.
Computational complexity per samples is for ranking and MST computation (using Delaunay-based MST algorithms), for the binned reverse cloud, and regression complexity depends on the choice of estimator (e.g., smoothing splines).
Parameter choices for (recommended ) and mesoscopic scaling are robust; must neither be too small (under-smoothing) nor too large (over-smoothing), ensuring resolution of the 1D collapse in the reverse cloud.
6. Empirical Validation and Performance
Extensive experiments illustrate the efficacy and stability of TRA-s:
- On synthetic ANMs with various forms (cubic, near-linear, heteroscedastic, non-monotone), TRA-s achieves low directed risk and matches theoretical predictions, while most established baselines (RESIT, IGCI, RECI, CDCI, COMIC, RCC, NCC) exhibit regular failure modes under stress (Bouchattaoui, 31 Jan 2026).
- Under confounding, TRA-s combined with the abstention/correction procedure (TRA-C) abstains appropriately, whereas alternatives tend to commit to a potentially incorrect direction.
- On the Tübingen real-world benchmark, TRA-s delivers optimal coverage and second-lowest risk, with the abstaining TRA-C attaining highest decided accuracy and minimal overall risk.
- Ablation studies confirm that smoothing of the reverse direction is necessary for robust separation under fixed noise, and performance is stable to tuning in the recommended and scaling parameter ranges.
7. Practical Usage and Limitations
TRA-s is applicable to bivariate causal inference under additive-noise settings, robust to moderate regression error, and computationally tractable for moderate to large given efficient MST computation. Binning is central: it restores the geometric signature required for correct causal inference when raw residual dispersion precludes a 1D collapse. The approach is grounded in persistent-homology theory, but performance may degrade for non-ANM or heavily confounded distributions (necessitating abstention via TRA-C).
A plausible implication is that TRA-s, by explicit geometric regularization of the residual cloud, defines a broadly applicable, nonparametric, and theoretically rigorous template for distribution-based causal inference that is robust to non-vanishing noise scales, provided the core bivariate ANM structure holds (Bouchattaoui, 31 Jan 2026).