"machine unlearning" Papers

38 papers found

A Closer Look at Machine Unlearning for Large Language Models

Xiaojian Yuan, Tianyu Pang, Chao Du et al.

ICLR 2025posterarXiv:2410.08109
34
citations

Adversarial Machine Unlearning

Zonglin Di, Sixie Yu, Yevgeniy Vorobeychik et al.

ICLR 2025posterarXiv:2406.07687
11
citations

A Reliable Cryptographic Framework for Empirical Machine Unlearning Evaluation

Yiwen Tu, Pingbang Hu, Jiaqi Ma

NEURIPS 2025posterarXiv:2404.11577
2
citations

Ascent Fails to Forget

Ioannis Mavrothalassitis, Pol Puigdemont, Noam Levi et al.

NEURIPS 2025posterarXiv:2509.26427

Catastrophic Failure of LLM Unlearning via Quantization

Zhiwei Zhang, Fali Wang, Xiaomin Li et al.

ICLR 2025posterarXiv:2410.16454
43
citations

Composable Interventions for Language Models

Arinbjörn Kolbeinsson, Kyle O'Brien, Tianjin Huang et al.

ICLR 2025posterarXiv:2407.06483
5
citations

CoUn: Empowering Machine Unlearning via Contrastive Learning

Yasser Khalil, Mehdi Setayesh, Hongliang Li

NEURIPS 2025posterarXiv:2509.16391

Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness

Rongzhe Wei, Peizhi Niu, Hans Hao-Hsun Hsu et al.

NEURIPS 2025posterarXiv:2506.05735
6
citations

FALCON: Fine-grained Activation Manipulation by Contrastive Orthogonal Unalignment for Large Language Model

Jinwei Hu, Zhenglin Huang, Xiangyu Yin et al.

NEURIPS 2025posterarXiv:2502.01472
1
citations

Hessian-Free Online Certified Unlearning

Xinbao Qiao, Meng Zhang, Ming Tang et al.

ICLR 2025posterarXiv:2404.01712
5
citations

Hippocampal-like Sequential Editing for Continual Knowledge Updates in Large Language Models

Quntian Fang, Zhen Huang, Zhiliang Tian et al.

NEURIPS 2025poster

Keeping an Eye on LLM Unlearning: The Hidden Risk and Remedy

Jie Ren, Zhenwei Dai, Xianfeng Tang et al.

NEURIPS 2025posterarXiv:2506.00359
7
citations

LoTUS: Large-Scale Machine Unlearning with a Taste of Uncertainty

Christoforos N. Spartalis, Theodoros Semertzidis, Efstratios Gavves et al.

CVPR 2025posterarXiv:2503.18314
8
citations

Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research

A. Feder Cooper, Christopher A. Choquette-Choo, Miranda Bogen et al.

NEURIPS 2025oralarXiv:2412.06966
1
citations

Machine Unlearning Fails to Remove Data Poisoning Attacks

Martin Pawelczyk, Jimmy Di, Yiwei Lu et al.

ICLR 2025posterarXiv:2406.17216
29
citations

Machine Unlearning via Simulated Oracle Matching

Kristian G Georgiev, Roy Rinberg, Sam Park et al.

ICLR 2025poster
3
citations

MUNBa: Machine Unlearning via Nash Bargaining

Jing Wu, Mehrtash Harandi

ICCV 2025posterarXiv:2411.15537
8
citations

MUSE: Machine Unlearning Six-Way Evaluation for Language Models

Weijia Shi, Jaechan Lee, Yangsibo Huang et al.

ICLR 2025posterarXiv:2407.06460
157
citations

On Large Language Model Continual Unlearning

Chongyang Gao, Lixu Wang, Kaize Ding et al.

ICLR 2025posterarXiv:2407.10223
29
citations

Position: Bridge the Gaps between Machine Unlearning and AI Regulation

Bill Marino, Meghdad Kurmanji, Nicholas Lane

NEURIPS 2025oralarXiv:2502.12430
4
citations

Probing Hidden Knowledge Holes in Unlearned LLMs

Myeongseob Ko, Hoang Anh Just, Charles Fleming et al.

NEURIPS 2025poster

Provable unlearning in topic modeling and downstream tasks

Stanley Wei, Sadhika Malladi, Sanjeev Arora et al.

ICLR 2025posterarXiv:2411.12600

Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels

Yujia Tong, Yuze Wang, Jingling Yuan et al.

ICCV 2025posterarXiv:2503.13917
6
citations

RUAGO: Effective and Practical Retain-Free Unlearning via Adversarial Attack and OOD Generator

SangYong Lee, Sangjun Chung, Simon Woo

NEURIPS 2025poster

Selective Unlearning via Representation Erasure Using Domain Adversarial Training

Nazanin Sepahvand, Eleni Triantafillou, Hugo Larochelle et al.

ICLR 2025poster
3
citations

Toward Efficient Data-Free Unlearning

Chenhao Zhang, Shaofei Shen, Weitong Chen et al.

AAAI 2025paperarXiv:2412.13790
3
citations

Towards Effective Evaluations and Comparisons for LLM Unlearning Methods

Qizhou Wang, Bo Han, Puning Yang et al.

ICLR 2025posterarXiv:2406.09179
23
citations

Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning

Somnath Basu Roy Chowdhury, Krzysztof Choromanski, Arijit Sehanobish et al.

ICLR 2025posterarXiv:2406.16257
22
citations

Towards Source-Free Machine Unlearning

Sk Miraj Ahmed, Umit Basaran, Dripta S. Raychaudhuri et al.

CVPR 2025posterarXiv:2508.15127
2
citations

Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient

Yongliang Wu, Shiji Zhou, Mingzhuo Yang et al.

AAAI 2025paperarXiv:2405.15304
51
citations

Backdoor Attacks via Machine Unlearning

Zihao Liu, Tianhao Wang, Mengdi Huai et al.

AAAI 2024paperarXiv:2510.13322

Fast Machine Unlearning without Retraining through Selective Synaptic Dampening

Jack Foster, Stefan Schoepf, Alexandra Brintrup

AAAI 2024paperarXiv:2308.07707
175
citations

In-Context Unlearning: Language Models as Few-Shot Unlearners

Martin Pawelczyk, Seth Neel, Himabindu Lakkaraju

ICML 2024poster

Is Retain Set All You Need in Machine Unlearning? Restoring Performance of Unlearned Models with Out-Of-Distribution Images

Jacopo Bonato, Marco Cotogni, Luigi Sabetta

ECCV 2024posterarXiv:2404.12922
19
citations

MultiDelete for Multimodal Machine Unlearning

Jiali Cheng, Hadi Amiri

ECCV 2024posterarXiv:2311.12047
13
citations

Rethinking Adversarial Robustness in the Context of the Right to be Forgotten

Chenxu Zhao, Wei Qian, Yangyi Li et al.

ICML 2024poster

Towards Certified Unlearning for Deep Neural Networks

Binchi Zhang, Yushun Dong, Tianhao Wang et al.

ICML 2024posterarXiv:2408.00920

Verification of Machine Unlearning is Fragile

Binchi Zhang, Zihan Chen, Cong Shen et al.

ICML 2024posterarXiv:2408.00929