NeurIPS "adversarial training" Papers
8 papers found
Breaking Latent Prior Bias in Detectors for Generalizable AIGC Image Detection
Yue Zhou, Xinan He, Kaiqing Lin et al.
NeurIPS 2025posterarXiv:2506.00874
11
citations
Distributional LLM-as-a-Judge
Luyu Chen, Zeyu Zhang, Haoran Tan et al.
NeurIPS 2025poster
MEIcoder: Decoding Visual Stimuli from Neural Activity by Leveraging Most Exciting Inputs
Jan Sobotka, Luca Baroni, Ján Antolík
NeurIPS 2025posterarXiv:2510.20762
Out-of-Distribution Generalized Graph Anomaly Detection with Homophily-aware Environment Mixup
Sibo Tian, Xin Wang, Zeyang Zhang et al.
NeurIPS 2025poster
Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence
Shaopeng Fu, Liang Ding, Jingfeng ZHANG et al.
NeurIPS 2025posterarXiv:2502.04204
6
citations
Solving Neural Min-Max Games: The Role of Architecture, Initialization & Dynamics
Deep Patel, Emmanouil-Vasileios Vlatakis-Gkaragkounis
NeurIPS 2025spotlightarXiv:2512.00389
Understanding and Improving Fast Adversarial Training against $l_0$ Bounded Perturbations
Xuyang Zhong, Yixiao Huang, Chen Liu
NeurIPS 2025poster
ZEBRA: Towards Zero-Shot Cross-Subject Generalization for Universal Brain Visual Decoding
Haonan Wang, Jingyu Lu, Hongrui Li et al.
NeurIPS 2025posterarXiv:2510.27128