NEURIPS 2025 "explainable ai" Papers

18 papers found

$\mathcal{X}^2$-DFD: A framework for e$\mathcal{X}$plainable and e$\mathcal{X}$tendable Deepfake Detection

Yize Chen, Zhiyuan Yan, Guangliang Cheng et al.

NEURIPS 2025poster

Advancing Interpretability of CLIP Representations with Concept Surrogate Model

Nhat Hoang-Xuan, Xiyuan Wei, Wanli Xing et al.

NEURIPS 2025poster

Contimask: Explaining Irregular Time Series via Perturbations in Continuous Time

Max Moebus, Björn Braun, Christian Holz

NEURIPS 2025poster

Disentangled Concepts Speak Louder Than Words: Explainable Video Action Recognition

Jongseo Lee, Wooil Lee, Gyeong-Moon Park et al.

NEURIPS 2025spotlightarXiv:2511.03725

Explainable Reinforcement Learning from Human Feedback to Improve Alignment

Shicheng Liu, Siyuan Xu, Wenjie Qiu et al.

NEURIPS 2025posterarXiv:2512.13837

Explainably Safe Reinforcement Learning

Sabine Rieder, Stefan Pranger, Debraj Chakraborty et al.

NEURIPS 2025poster

LeapFactual: Reliable Visual Counterfactual Explanation Using Conditional Flow Matching

Zhuo Cao, Xuan Zhao, Lena Krieger et al.

NEURIPS 2025posterarXiv:2510.14623
1
citations

Minimizing False-Positive Attributions in Explanations of Non-Linear Models

Anders Gjølbye, Stefan Haufe, Lars Kai Hansen

NEURIPS 2025posterarXiv:2505.11210
1
citations

Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model

Dongki Kim, Wonbin Lee, Sung Ju Hwang

NEURIPS 2025posterarXiv:2502.13449
10
citations

On Logic-based Self-Explainable Graph Neural Networks

Alessio Ragno, Marc Plantevit, Céline Robardet

NEURIPS 2025poster

Provable Gradient Editing of Deep Neural Networks

Zhe Tao, Aditya V Thakur

NEURIPS 2025spotlight
1
citations

RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Chest X-ray with Zero-Shot Multi-Task Capability

Jonggwon Park, Byungmu Yoon, Soobum Kim et al.

NEURIPS 2025posterarXiv:2504.07416
1
citations

Regression-adjusted Monte Carlo Estimators for Shapley Values and Probabilistic Values

R. Teal Witter, Yurong Liu, Christopher Musco

NEURIPS 2025posterarXiv:2506.11849
2
citations

Representational Difference Explanations

Neehar Kondapaneni, Oisin Mac Aodha, Pietro Perona

NEURIPS 2025posterarXiv:2505.23917

Scalable, Explainable and Provably Robust Anomaly Detection with One-Step Flow Matching

Zhong Li, Qi Huang, Yuxuan Zhu et al.

NEURIPS 2025posterarXiv:2510.18328

SHAP values via sparse Fourier representation

Ali Gorji, Andisheh Amrollahi, Andreas Krause

NEURIPS 2025spotlightarXiv:2410.06300
2
citations

Smoothed Differentiation Efficiently Mitigates Shattered Gradients in Explanations

Adrian Hill, Neal McKee, Johannes Maeß et al.

NEURIPS 2025poster

Sound Logical Explanations for Mean Aggregation Graph Neural Networks

Matthew Morris, Ian Horrocks

NEURIPS 2025posterarXiv:2511.11593