"unlearning" Papers
3 papers found
Conference
Agents Are All You Need for LLM Unlearning
Debdeep Sanyal, Murari Mandal
COLM 2025paperarXiv:2502.00406
8
citations
A Probabilistic Perspective on Unlearning and Alignment for Large Language Models
Yan Scholten, Stephan Günnemann, Leo Schwinn
ICLR 2025arXiv:2410.03523
17
citations
SAEs Can Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs
Aashiq Muhamed, Jacopo Bonato, Mona T. Diab et al.
COLM 2025paper
17
citations