Critical-Tensor Identification in ViTaX
- Critical-tensor identification is a formal XAI-based feature selection technique that isolates minimal input feature subsets to guarantee targeted robustness in neural networks.
- The method utilizes gradient-based heuristic ranking and binary search to efficiently determine the smallest set of features satisfying targeted ε-robustness.
- Empirical results on MNIST and GTSRB highlight its interpretability and formal verification, enhancing safety in high-stakes applications.
Critical-tensor identification is a formal XAI-based feature selection technique introduced in the ViTaX (Verified and Targeted Explanations) framework. It targets the identification of minimal input feature subsets (critical tensors) that are most sensitive to class transitions within deep neural networks, and provides mathematical guarantees about the model’s resilience to perturbations directed toward user-specified alternative classes. In contrast to traditional attribution methods, which lack formal robustness assurances, and generic formal methods, which are untargeted, this approach enables verifiable, targeted, and succinct explanations for high-stakes, safety-critical applications (Wang et al., 5 Apr 2026).
1. Definition of Critical Tensor
In the ViTaX setting, given an input tensor and a neural classifier , a critical tensor (with ) is defined as the smallest set of input feature indices such that perturbing by no more than (in a chosen norm) cannot flip the classifier's output from a true class to a specified critical alternative class . Formally, PRESERVED_PLACEHOLDER_4^ denotes the restriction of the input to indices in 0. The identification of a critical tensor provides a succinct semifactual explanation: even if the features indexed by 1 are perturbed maximally within 2, the classifier remains robust to the transition from 3 to 4.
This concept is particularly relevant in domains where different misclassifications pose differing levels of risk. For example, in traffic sign recognition, confusing a "Stop" sign with a "60 kph" sign is substantially more detrimental than with a "No Passing" sign. Critical tensors focus the explanation on the most consequential potential failure modes (Wang et al., 5 Apr 2026).
2. Targeted 5-Robustness Specification
Standard robustness verification assesses whether no class logit interval can overlap with the true class under arbitrary 6-bounded input perturbations. In contrast, ViTaX formulates Targeted 7-Robustness with respect to a specific alternative class 8, formalized as follows:
Let 4^ be a neural network, 0 the input, 1 the true class, and 2 the user-specified alternative. Consider perturbing features in 3 such that 4 while 5. Let 6 denote all such perturbed inputs, with 7 the full index set.
A feature subset 8 is said to satisfy T-ROB (Targeted Robustness) if the following holds:
4^
where 0 are the interval bounds on output logit 1 under the set 2. If T-ROB holds, the class transition 3 is formally impossible under 4-bounded perturbations to features in 5 (Wang et al., 5 Apr 2026).
3. Algorithm for Critical Tensor Identification
ViTaX operationalizes critical-tensor selection via an efficient algorithm, leveraging feature-sensitivity heuristics and binary search. The principle steps are:
- Heuristic Ranking (6): Compute 7 by ranking features according to the gradient-based sensitivity to the target class 8, i.e., using 4^ for each feature 0.
- Binary Search for Minimality: Iteratively select increasing prefixes of the ranked features and verify T-ROB via a formal reachability solver 1. The candidate set 2 is the smallest prefix satisfying T-ROB.
- Formal Verification: At each binary search iteration, call the solver 3 to check whether, for the selected 4, the network is robust to the 5 transition under the specified perturbation.
The algorithm returns a provably minimal (under ranking 6) critical tensor 7. The complexity of the process is logarithmic in the number of features due to the binary search strategy.
ViTaX Pseudocode (Algorithm 1):
0 The reachability solver 8 (e.g., Star-based reachability, MILP) certifies the required specification over infinite input sets 4^ (Wang et al., 5 Apr 2026).
4. Formal Guarantees and Properties
ViTaX's critical-tensor identification is underpinned by theoretical guarantees assuming the reachability solver is sound and complete and the ranking is sensitivity-optimal:
- Soundness: If 0 certifies T-ROB for 1, then 2 truly holds for all 3, meaning the 4 transition is impossible under such perturbations.
- Maximality (w.r.t. 5): 6 is the largest (minimal in cardinality) prefix of 7 such that the T-ROB criterion holds; supersets of 8 (w.r.t. the ranking) will violate T-ROB.
- Algorithmic Complexity: 4^ calls to the reachability solver due to binary search.
These properties ensure that the returned critical tensor 0 is both succinct and verifiably robust with respect to the 1 transition (Wang et al., 5 Apr 2026).
5. Empirical Examples and Size–Fidelity Trade-Off
ViTaX demonstrates practical critical-tensor identification on several datasets:
MNIST Example:
- Input: "4" (2); Target: "4" (3).
- Ranking identifies pixels critical for the "4→4" decision (e.g., pixels in the upper-loop of "4").
- Binary search yields a critical tensor 4 of 5 pixels. Larger 5 subsets increase robustness but decrease interpretability, defining a size–fidelity trade-off.
- Practitioners can explore 6 vs. 7 curves to balance explanation succinctness against guarantee strength.
GTSRB Example:
- Input: "Stop" sign (8); Target: "60 kph" (4).
- ViTaX selects 30 interior pixels (out of 3072) critical for the "Stop→60 kph" misclassification, with higher fidelity and lower cardinality than the 472-pixel baseline from VeriX.
This suggests that ViTaX explanations are both more interpretable and more faithful to the underlying risk structure than previous methods (Wang et al., 5 Apr 2026).
6. Scope of Applicability and Generalizations
ViTaX and its critical-tensor approach are network-architecture and modality agnostic, supporting:
- Architectures: MLPs, CNNs, ResNets, Inception, Transformers, contingent upon an appropriate reachability backend (e.g., Approx-Star, CP-Star, MILP).
- Input Modalities: Images, time series, textual embeddings, audio spectrograms, and graph features, with critical tensor selection performed on the flattened feature set.
- Beyond Classification: Adaptation to regression, ranked retrieval, or multi-label tasks by tuning the T-ROB specification to ensure robust margin (e.g., minimal lower-bound difference between outputs).
- Higher-Order Explanations: Chaining calls to the algorithm with varying target classes or perturbation budgets allows coverage of trade-offs and Pareto-optimal robustness sets.
A plausible implication is that critical-tensor identification via ViTaX serves as a general tool for providing semifactual, formally verified, high-fidelity explanations across a spectrum of ML applications, particularly where targeted safety guarantees are required (Wang et al., 5 Apr 2026).