"model security" Papers
3 papers found
Backdoor Attacks via Machine Unlearning
Zihao Liu, Tianhao Wang, Mengdi Huai et al.
AAAI 2024paperarXiv:2510.13322
Defense against Backdoor Attack on Pre-trained Language Models via Head Pruning and Attention Normalization
Xingyi Zhao, Depeng Xu, Shuhan Yuan
ICML 2024poster
Elijah: Eliminating Backdoors Injected in Diffusion Models via Distribution Shift
Shengwei An, Sheng-Yen Chou, Kaiyuan Zhang et al.
AAAI 2024paperarXiv:2312.00050