Poster "interpretability methods" Papers
5 papers found
Concept-Guided Interpretability via Neural Chunking
Shuchen Wu, Stephan Alaniz, Shyamgopal Karthik et al.
NeurIPS 2025posterarXiv:2505.11576
Layer-Wise Modality Decomposition for Interpretable Multimodal Sensor Fusion
Jaehyun Park, Konyul Park, Daehun Kim et al.
NeurIPS 2025posterarXiv:2511.00859
Residual Stream Analysis with Multi-Layer SAEs
Tim Lawson, Lucy Farnik, Conor Houghton et al.
ICLR 2025posterarXiv:2409.04185
11
citations
Listenable Maps for Audio Classifiers
Francesco Paissan, Mirco Ravanelli, Cem Subakan
ICML 2024poster
SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Arshia Soltani Moakhar, Eugenia Iofinova, Elias Frantar et al.
ICML 2024poster