"explanation robustness" Papers
3 papers found
Probabilistic Stability Guarantees for Feature Attributions
Helen Jin, Anton Xue, Weiqiu You et al.
NeurIPS 2025posterarXiv:2504.13787
6
citations
Axiomatic Aggregations of Abductive Explanations
Gagan Biradar, Yacine Izza, Elita Lobo et al.
AAAI 2024paperarXiv:2310.03131
9
citations
Provably Better Explanations with Optimized Aggregation of Feature Attributions
Thomas Decker, Ananta Bhattarai, Jindong Gu et al.
ICML 2024poster