Papers
Topics
Authors
Recent
Search
2000 character limit reached

Critical-Tensor Identification in ViTaX

Updated 27 April 2026
  • Critical-tensor identification is a formal XAI-based feature selection technique that isolates minimal input feature subsets to guarantee targeted robustness in neural networks.
  • The method utilizes gradient-based heuristic ranking and binary search to efficiently determine the smallest set of features satisfying targeted ε-robustness.
  • Empirical results on MNIST and GTSRB highlight its interpretability and formal verification, enhancing safety in high-stakes applications.

Critical-tensor identification is a formal XAI-based feature selection technique introduced in the ViTaX (Verified and Targeted Explanations) framework. It targets the identification of minimal input feature subsets (critical tensors) that are most sensitive to class transitions within deep neural networks, and provides mathematical guarantees about the model’s resilience to perturbations directed toward user-specified alternative classes. In contrast to traditional attribution methods, which lack formal robustness assurances, and generic formal methods, which are untargeted, this approach enables verifiable, targeted, and succinct explanations for high-stakes, safety-critical applications (Wang et al., 5 Apr 2026).

1. Definition of Critical Tensor

In the ViTaX setting, given an input tensor x∈Rw×h×cx\in\mathbb{R}^{w\times h\times c} and a neural classifier ff, a critical tensor A⊆{1,…,n}A\subseteq\{1,\ldots,n\} (with n=w⋅h⋅cn=w\cdot h\cdot c) is defined as the smallest set of input feature indices such that perturbing xAx_A by no more than ε\varepsilon (in a chosen ℓp\ell_p norm) cannot flip the classifier's output from a true class yy to a specified critical alternative class tt. Formally, PRESERVED_PLACEHOLDER_4^ denotes the restriction of the input to indices in ff0. The identification of a critical tensor provides a succinct semifactual explanation: even if the features indexed by ff1 are perturbed maximally within ff2, the classifier remains robust to the transition from ff3 to ff4.

This concept is particularly relevant in domains where different misclassifications pose differing levels of risk. For example, in traffic sign recognition, confusing a "Stop" sign with a "60 kph" sign is substantially more detrimental than with a "No Passing" sign. Critical tensors focus the explanation on the most consequential potential failure modes (Wang et al., 5 Apr 2026).

2. Targeted ff5-Robustness Specification

Standard robustness verification assesses whether no class logit interval can overlap with the true class under arbitrary ff6-bounded input perturbations. In contrast, ViTaX formulates Targeted ff7-Robustness with respect to a specific alternative class ff8, formalized as follows:

Let ff4^ be a neural network, A⊆{1,…,n}A\subseteq\{1,\ldots,n\}0 the input, A⊆{1,…,n}A\subseteq\{1,\ldots,n\}1 the true class, and A⊆{1,…,n}A\subseteq\{1,\ldots,n\}2 the user-specified alternative. Consider perturbing features in A⊆{1,…,n}A\subseteq\{1,\ldots,n\}3 such that A⊆{1,…,n}A\subseteq\{1,\ldots,n\}4 while A⊆{1,…,n}A\subseteq\{1,\ldots,n\}5. Let A⊆{1,…,n}A\subseteq\{1,\ldots,n\}6 denote all such perturbed inputs, with A⊆{1,…,n}A\subseteq\{1,\ldots,n\}7 the full index set.

A feature subset A⊆{1,…,n}A\subseteq\{1,\ldots,n\}8 is said to satisfy T-ROB (Targeted Robustness) if the following holds:

A⊆{1,…,n}A\subseteq\{1,\ldots,n\}4^

where n=wâ‹…hâ‹…cn=w\cdot h\cdot c0 are the interval bounds on output logit n=wâ‹…hâ‹…cn=w\cdot h\cdot c1 under the set n=wâ‹…hâ‹…cn=w\cdot h\cdot c2. If T-ROB holds, the class transition n=wâ‹…hâ‹…cn=w\cdot h\cdot c3 is formally impossible under n=wâ‹…hâ‹…cn=w\cdot h\cdot c4-bounded perturbations to features in n=wâ‹…hâ‹…cn=w\cdot h\cdot c5 (Wang et al., 5 Apr 2026).

3. Algorithm for Critical Tensor Identification

ViTaX operationalizes critical-tensor selection via an efficient algorithm, leveraging feature-sensitivity heuristics and binary search. The principle steps are:

  1. Heuristic Ranking (n=wâ‹…hâ‹…cn=w\cdot h\cdot c6): Compute n=wâ‹…hâ‹…cn=w\cdot h\cdot c7 by ranking features according to the gradient-based sensitivity to the target class n=wâ‹…hâ‹…cn=w\cdot h\cdot c8, i.e., using n=wâ‹…hâ‹…cn=w\cdot h\cdot c4^ for each feature xAx_A0.
  2. Binary Search for Minimality: Iteratively select increasing prefixes of the ranked features and verify T-ROB via a formal reachability solver xAx_A1. The candidate set xAx_A2 is the smallest prefix satisfying T-ROB.
  3. Formal Verification: At each binary search iteration, call the solver xAx_A3 to check whether, for the selected xAx_A4, the network is robust to the xAx_A5 transition under the specified perturbation.

The algorithm returns a provably minimal (under ranking xAx_A6) critical tensor xAx_A7. The complexity of the process is logarithmic in the number of features due to the binary search strategy.

ViTaX Pseudocode (Algorithm 1):

yy0 The reachability solver xAx_A8 (e.g., Star-based reachability, MILP) certifies the required specification over infinite input sets xAx_A4^ (Wang et al., 5 Apr 2026).

4. Formal Guarantees and Properties

ViTaX's critical-tensor identification is underpinned by theoretical guarantees assuming the reachability solver is sound and complete and the ranking is sensitivity-optimal:

  • Soundness: If ε\varepsilon0 certifies T-ROB for ε\varepsilon1, then ε\varepsilon2 truly holds for all ε\varepsilon3, meaning the ε\varepsilon4 transition is impossible under such perturbations.
  • Maximality (w.r.t. ε\varepsilon5): ε\varepsilon6 is the largest (minimal in cardinality) prefix of ε\varepsilon7 such that the T-ROB criterion holds; supersets of ε\varepsilon8 (w.r.t. the ranking) will violate T-ROB.
  • Algorithmic Complexity: ε\varepsilon4^ calls to the reachability solver due to binary search.

These properties ensure that the returned critical tensor â„“p\ell_p0 is both succinct and verifiably robust with respect to the â„“p\ell_p1 transition (Wang et al., 5 Apr 2026).

5. Empirical Examples and Size–Fidelity Trade-Off

ViTaX demonstrates practical critical-tensor identification on several datasets:

MNIST Example:

  • Input: "4" (â„“p\ell_p2); Target: "4" (â„“p\ell_p3).
  • Ranking identifies pixels critical for the "4→4" decision (e.g., pixels in the upper-loop of "4").
  • Binary search yields a critical tensor â„“p\ell_p4 of 5 pixels. Larger â„“p\ell_p5 subsets increase robustness but decrease interpretability, defining a size–fidelity trade-off.
  • Practitioners can explore â„“p\ell_p6 vs. â„“p\ell_p7 curves to balance explanation succinctness against guarantee strength.

GTSRB Example:

  • Input: "Stop" sign (â„“p\ell_p8); Target: "60 kph" (â„“p\ell_p4).
  • ViTaX selects 30 interior pixels (out of 3072) critical for the "Stop→60 kph" misclassification, with higher fidelity and lower cardinality than the 472-pixel baseline from VeriX.

This suggests that ViTaX explanations are both more interpretable and more faithful to the underlying risk structure than previous methods (Wang et al., 5 Apr 2026).

6. Scope of Applicability and Generalizations

ViTaX and its critical-tensor approach are network-architecture and modality agnostic, supporting:

  • Architectures: MLPs, CNNs, ResNets, Inception, Transformers, contingent upon an appropriate reachability backend (e.g., Approx-Star, CP-Star, MILP).
  • Input Modalities: Images, time series, textual embeddings, audio spectrograms, and graph features, with critical tensor selection performed on the flattened feature set.
  • Beyond Classification: Adaptation to regression, ranked retrieval, or multi-label tasks by tuning the T-ROB specification to ensure robust margin (e.g., minimal lower-bound difference between outputs).
  • Higher-Order Explanations: Chaining calls to the algorithm with varying target classes or perturbation budgets allows coverage of trade-offs and Pareto-optimal robustness sets.

A plausible implication is that critical-tensor identification via ViTaX serves as a general tool for providing semifactual, formally verified, high-fidelity explanations across a spectrum of ML applications, particularly where targeted safety guarantees are required (Wang et al., 5 Apr 2026).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Critical-Tensor Identification (XAI-Based Selection).