Papers
Topics
Authors
Recent
Search
2000 character limit reached

TRA-s: Topological Residual Asymmetry with Smoothing

Updated 7 February 2026
  • The paper introduces TRA-s, a robust method for inferring causal direction in additive-noise models using geometric persistent homology of regressor-residual clouds.
  • It employs a binning strategy on reverse residuals to induce a one-dimensional signal, restoring identifiability when noise does not vanish.
  • Extensive experiments validate its theoretical consistency, computational efficiency, and superior performance over traditional causal inference methods.

Topological Residual Asymmetry with Smoothing (TRA-s) defines a robust criterion for inferring bivariate causal direction in additive-noise models (ANMs) under fixed noise, leveraging geometric signatures obtained from the persistent homology of regressor-residual clouds. It addresses limitations of the original TRA method, which loses discriminatory power when the noise level is fixed and does not vanish. TRA-s employs a binning strategy that induces a one-dimensional geometric signal in the reverse residual direction, restoring identifiability. It is theoretically consistent, computationally practical, and empirically validated across diverse synthetic and real-world benchmarks (Bouchattaoui, 31 Jan 2026).

1. Motivation and Formal Definition

In the classical Topological Residual Asymmetry (TRA) method, causal direction is inferred by comparing the geometric structure of two residual clouds after copula standardization: the forward direction XYX \rightarrow Y produces residuals that are approximately independent, yielding a 2D bulk, while the reverse direction YXY \rightarrow X under small noise collapses to a one-dimensional tube. The separation is quantified by a 0D persistent-homology functional computed from the Euclidean minimum spanning tree (MST) on the residual cloud, with the raw TRA score Δn\Delta_n providing directionality.

However, under fixed additive noise, the reverse residuals do not collapse; their distribution remains thick, and the original TRA statistic loses its discrimination. TRA-s overcomes this by binning reverse residuals along the YY copula axis and averaging, converting the "thick" cloud into a binned average that again collapses to a 1D signature at mesoscopic scale, while forward residuals retain a 2D structure.

Formally, let {(Xi,Yi)}i=1n\{(X_i, Y_i)\}_{i=1}^n be sampled from an additive-noise model Yi=f(Xi)+εiY_i = f(X_i) + \varepsilon_i with $\varepsilon_i \indep X_i$, Var(ε)=σ2>0\operatorname{Var}(\varepsilon) = \sigma^2 > 0. TRA-s proceeds by cross-fitting regressors in both directions, computing residuals, copula-standardizing, binning the reverse direction, and calculating the MST-based persistence profiles over fixed mesoscopic windows. The TRA-s score Δ~n\widetilde{\Delta}_n is then the difference in windowed TP-profiles between the forward and binned-reverse clouds. If Δ~n>0\widetilde{\Delta}_n > 0, the method infers XYX \to Y; if Δ~n<0\widetilde{\Delta}_n < 0, YXY \to X (Bouchattaoui, 31 Jan 2026).

2. Binning and Copula Standardization Procedure

TRA-s modifies the reverse residual cloud via a binning strategy following rank-copula standardization:

  • For the reverse direction (XYX | Y), only the YY coordinate is rank-transformed: Ui=rank(Yi)n+1U_i = \frac{\mathrm{rank}(Y_i)}{n+1}.
  • [0,1][0,1] is partitioned into BnB_n equal-width bins: In,b=(b1Bn,bBn]I_{n, b} = (\frac{b-1}{B_n}, \frac{b}{B_n}], b=1,,Bnb = 1, \dotsc, B_n.
  • For each bin, empirical bin centers uˉb\bar{u}_b and mean residuals rˉb\bar{r}_b are computed over the points within the bin.
  • The binned cloud is R^XY(n)={(uˉb,rˉb):b=1,,Bn}[0,1]×R\widehat{\mathcal R}^{(n)}_{X\mid Y} = \{(\bar{u}_b, \bar{r}_b): b = 1, \dots, B_n\} \subset [0,1] \times \mathbb{R}.

The bin number BnB_n is chosen such that BnB_n \to \infty, Bn=o(n)B_n = o(n), and Bn7/3logBn/n0B_n^{7/3} \log B_n / n \to 0, ensuring asymptotic validity without over-smoothing or excessive bias (Bouchattaoui, 31 Jan 2026).

3. Persistent Homology Functional and TRA-s Statistic

The core geometric statistic is a normalized windowed profile of edge-lengths in the MST of the residual cloud, specialized to a "soft window" [α,β][\alpha, \beta] that captures mesoscopic geometry:

$\overline{\TP_0^{[\alpha, \beta]}(\mathcal{R})} = \frac{1}{(M-1)(\beta-\alpha)} \sum_{e \in \operatorname{MST}(\mathcal{R})} \Psi_{\alpha, \beta}(\|e\|),$

with

Ψα,β(t)=(min{t,β}α)+={0,tα, tα,α<t<β, βα,tβ.\Psi_{\alpha, \beta}(t) = (\min\{t, \beta\} - \alpha)_+ = \begin{cases} 0, & t \le \alpha, \ t - \alpha, & \alpha < t < \beta, \ \beta - \alpha, & t \ge \beta. \end{cases}

  • For the forward (YXY \mid X) copula cloud, set M=nM = n, αn=κn2/3\alpha_n = \kappa n^{-2/3}, and βn=cβαn\beta_n = c_\beta \alpha_n.
  • For the binned-reverse cloud (XYX \mid Y), M=BnM = B_n, α~n=κBn2/3\widetilde\alpha_n = \kappa B_n^{-2/3}, β~n=cβα~n\widetilde\beta_n = c_\beta \widetilde\alpha_n.

The TRA-s test statistic is then: $\widetilde{\Delta}_n = \overline{\TP_0^{[\alpha_n, \beta_n]}(\widetilde{\mathcal R}^{(n)}_{Y\mid X})} - \overline{\TP_0^{[\widetilde\alpha_n, \widetilde\beta_n]}(\widehat{\mathcal R}^{(n)}_{X\mid Y})}.$ The direction is assigned according to the sign of Δ~n\widetilde{\Delta}_n (Bouchattaoui, 31 Jan 2026).

4. Theoretical Guarantees and Assumptions

Under the following conditions (cf. (Bouchattaoui, 31 Jan 2026), Assumptions 3.5–3.7):

  • Forward model: fC1f \in C^1, branchwise invertible, f[cf,Cf]|f'| \in [c_f, C_f], noise sub-Gaussian.
  • Reverse conditional mean m(y)=E[XY=y]m(y) = \mathbb{E}[X \mid Y = y] continuous.
  • Regression errors: cross-fitted regression risk o(1)o(1) and maxig^(Yi)m(Yi)=o(αn)\max_i |\widehat{g}(Y_i) - m(Y_i)| = o(\alpha_n).
  • Binning: BnB_n \to \infty, Bn=o(n)B_n = o(n), Bn7/3logBn/n0B_n^{7/3} \log B_n / n \to 0.
  • Reverse fluctuations: Xm(Y)ψ2Y=yK0σ\|X - m(Y)\|_{\psi_2 | Y = y} \le K_0 \sigma.

Theorem 3.2 (Bouchattaoui, 31 Jan 2026) establishes that, as nn \to \infty,

$\overline{\TP_0^{[\alpha_n, \beta_n]}(\widetilde R_{Y|X})} \to 1, \qquad \overline{\TP_0^{[\widetilde\alpha_n, \widetilde\beta_n]}(\widehat R_{X|Y})} \to 0,\qquad \widetilde{\Delta}_n \to 1,$

in probability. Therefore, a rule that declares XYX \to Y when Δ~n>τn0\widetilde\Delta_n > \tau_n \downarrow 0 is consistent with vanishing abstention probability (Bouchattaoui, 31 Jan 2026).

5. Algorithmic Procedure and Computational Considerations

The TRA-s method is implemented as follows:

  1. Cross-fit forward and reverse regressors using KK-fold splits, producing residuals ri(YX)r_i^{(Y|X)} and ri(XY)r_i^{(X|Y)}.
  2. Copula-standardize the forward cloud using both (X,r(YX))(X, r^{(Y|X)}) coordinates.
  3. Bin the reverse residuals along YY copula with BnB_n bins, compute bin mean coordinates (uˉb,rˉb)(\bar{u}_b, \bar{r}_b).
  4. Compute MSTs and mesoscopic persistence profiles in both directions.
  5. Form the TRA-s statistic and make a directional decision.

Computational complexity per nn samples is O(nlogn)O(n \log n) for ranking and MST computation (using Delaunay-based MST algorithms), O(BnlogBn)O(B_n \log B_n) for the binned reverse cloud, and regression complexity depends on the choice of estimator (e.g., smoothing splines).

Parameter choices for BnB_n (recommended n0.4n^{0.4}) and mesoscopic scaling (κ,cβ)(\kappa, c_\beta) are robust; BnB_n must neither be too small (under-smoothing) nor too large (over-smoothing), ensuring resolution of the 1D collapse in the reverse cloud.

6. Empirical Validation and Performance

Extensive experiments illustrate the efficacy and stability of TRA-s:

  • On synthetic ANMs with various forms (cubic, near-linear, heteroscedastic, non-monotone), TRA-s achieves low directed risk and matches theoretical predictions, while most established baselines (RESIT, IGCI, RECI, CDCI, COMIC, RCC, NCC) exhibit regular failure modes under stress (Bouchattaoui, 31 Jan 2026).
  • Under confounding, TRA-s combined with the abstention/correction procedure (TRA-C) abstains appropriately, whereas alternatives tend to commit to a potentially incorrect direction.
  • On the Tübingen real-world benchmark, TRA-s delivers optimal coverage and second-lowest risk, with the abstaining TRA-C attaining highest decided accuracy and minimal overall risk.
  • Ablation studies confirm that smoothing of the reverse direction is necessary for robust separation under fixed noise, and performance is stable to tuning in the recommended BnB_n and scaling parameter ranges.

7. Practical Usage and Limitations

TRA-s is applicable to bivariate causal inference under additive-noise settings, robust to moderate regression error, and computationally tractable for moderate to large nn given efficient MST computation. Binning is central: it restores the geometric signature required for correct causal inference when raw residual dispersion precludes a 1D collapse. The approach is grounded in persistent-homology theory, but performance may degrade for non-ANM or heavily confounded distributions (necessitating abstention via TRA-C).

A plausible implication is that TRA-s, by explicit geometric regularization of the residual cloud, defines a broadly applicable, nonparametric, and theoretically rigorous template for distribution-based causal inference that is robust to non-vanishing noise scales, provided the core bivariate ANM structure holds (Bouchattaoui, 31 Jan 2026).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to TRA-s.