Poster "robust generalization" Papers
5 papers found
Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence
Shaopeng Fu, Liang Ding, Jingfeng ZHANG et al.
NeurIPS 2025posterarXiv:2502.04204
6
citations
Benign Overfitting in Adversarial Training of Neural Networks
Yunjuan Wang, Kaibo Zhang, Raman Arora
ICML 2024poster
Density-Softmax: Efficient Test-time Model for Uncertainty Estimation and Robustness under Distribution Shifts
Ha Manh Bui, Anqi Liu
ICML 2024poster
Discovering Environments with XRM
Mohammad Pezeshki, Diane Bouchacourt, Mark Ibrahim et al.
ICML 2024poster
Two Heads are Actually Better than One: Towards Better Adversarial Robustness via Transduction and Rejection
Nils Palumbo, Yang Guo, Xi Wu et al.
ICML 2024poster