2025 "model robustness" Papers

10 papers found

Aligning Visual Contrastive learning models via Preference Optimization

Amirabbas Afzali, Borna khodabandeh, Ali Rasekh et al.

ICLR 2025posterarXiv:2411.08923
3
citations

Learning Diverse Attacks on Large Language Models for Robust Red-Teaming and Safety Tuning

Seanie Lee, Minsu Kim, Lynn Cherif et al.

ICLR 2025posterarXiv:2405.18540
44
citations

Modality-Aware SAM: Sharpness-Aware-Minimization Driven Gradient Modulation for Harmonized Multimodal Learning

Hossein Rajoli Nowdeh, Jie Ji, Xiaolong Ma et al.

NeurIPS 2025posterarXiv:2510.24919

Optimal Control for Transformer Architectures: Enhancing Generalization, Robustness and Efficiency

Kelvin Kan, Xingjian Li, Benjamin Zhang et al.

NeurIPS 2025posterarXiv:2505.13499
3
citations

Perturb a Model, Not an Image: Towards Robust Privacy Protection via Anti-Personalized Diffusion Models

Tae-Young Lee, Juwon Seo, Jong Hwan Ko et al.

NeurIPS 2025posterarXiv:2511.01307

Remarkable Robustness of LLMs: Stages of Inference?

Vedang Lad, Jin Hwa Lee, Wes Gurnee et al.

NeurIPS 2025oralarXiv:2406.19384
89
citations

Resolution Attack: Exploiting Image Compression to Deceive Deep Neural Networks

Wangjia Yu, Xiaomeng Fu, Qiao Li et al.

ICLR 2025poster

Rethinking Evaluation of Infrared Small Target Detection

Youwei Pang, Xiaoqi Zhao, Lihe Zhang et al.

NeurIPS 2025posterarXiv:2509.16888

Topological Zigzag Spaghetti for Diffusion-based Generation and Prediction on Graphs

Yuzhou Chen, Yulia Gel

ICLR 2025poster

TransferBench: Benchmarking Ensemble-based Black-box Transfer Attacks

Fabio Brau, Maura Pintor, Antonio Cinà et al.

NeurIPS 2025poster