"adversarial manipulation" Papers
4 papers found
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
Rylan Schaeffer, Dan Valentine, Luke Bailey et al.
ICLR 2025posterarXiv:2407.15211
22
citations
Fortifying Time Series: DTW-Certified Robust Anomaly Detection
Shijie Liu, Tansu Alpcan, Christopher Leckie et al.
NeurIPS 2025oral
TRAP: Targeted Redirecting of Agentic Preferences
Hangoo Kang, Jehyeok Yeon, Gagandeep Singh
NeurIPS 2025posterarXiv:2505.23518
2
citations
Web Artifact Attacks Disrupt Vision Language Models
Maan Qraitem, Piotr Teterwak, Kate Saenko et al.
ICCV 2025posterarXiv:2503.13652
2
citations