NEURIPS Poster "neural network interpretability" Papers
3 papers found
FACE: Faithful Automatic Concept Extraction
Dipkamal Bhusal, Michael Clifford, Sara Rampazzi et al.
NEURIPS 2025posterarXiv:2510.11675
3
citations
From Flat to Hierarchical: Extracting Sparse Representations with Matching Pursuit
Valérie Costa, Thomas Fel, Ekdeep S Lubana et al.
NEURIPS 2025posterarXiv:2506.03093
13
citations
Interpreting Emergent Features in Deep Learning-based Side-channel Analysis
Sengim Karayalcin, Marina Krček, Stjepan Picek
NEURIPS 2025posterarXiv:2502.00384