2025 Poster "robust generalization" Papers
2 papers found
Generating Less Certain Adversarial Examples Improves Robust Generalization
Minxing Zhang, Michael Backes, Xiao Zhang
ICLR 2025posterarXiv:2310.04539
1
citations
Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence
Shaopeng Fu, Liang Ding, Jingfeng ZHANG et al.
NeurIPS 2025posterarXiv:2502.04204
6
citations