Poster "adversarial machine learning" Papers
4 papers found
Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing
Keltin Grimes, Marco Christiani, David Shriver et al.
ICLR 2025posterarXiv:2412.13341
6
citations
Where the Devil Hides: Deepfake Detectors Can No Longer Be Trusted
Shuaiwei Yuan, Junyu Dong, Yuezun Li
CVPR 2025posterarXiv:2505.08255
2
citations
Energy-based Backdoor Defense without Task-Specific Samples and Model Retraining
Yudong Gao, Honglong Chen, Peng Sun et al.
ICML 2024poster
Uniformly Stable Algorithms for Adversarial Training and Beyond
Jiancong Xiao, Jiawei Zhang, Zhi-Quan Luo et al.
ICML 2024poster