"transfer attacks" Papers
4 papers found
Consensus-Robust Transfer Attacks via Parameter and Representation Perturbations
Shixin Li, Zewei Li, Xiaojing Ma et al.
NEURIPS 2025poster
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks
Maksym Andriushchenko, francesco croce, Nicolas Flammarion
ICLR 2025posterarXiv:2404.02151
387
citations
TransferBench: Benchmarking Ensemble-based Black-box Transfer Attacks
Fabio Brau, Maura Pintor, Antonio Cinà et al.
NEURIPS 2025poster
Web Artifact Attacks Disrupt Vision Language Models
Maan Qraitem, Piotr Teterwak, Kate Saenko et al.
ICCV 2025posterarXiv:2503.13652
2
citations