NEURIPS 2025 "black-box attacks" Papers
4 papers found
A Technical Report on “Erasing the Invisible”: The 2024 NeurIPS Competition on Stress Testing Image Watermarks
Mucong Ding, Bang An, Tahseen Rabbani et al.
NEURIPS 2025poster
Reasoning as an Adaptive Defense for Safety
Taeyoun Kim, Fahim Tajwar, Aditi Raghunathan et al.
NEURIPS 2025posterarXiv:2507.00971
9
citations
Transferable Black-Box One-Shot Forging of Watermarks via Image Preference Models
Tomas Soucek, Sylvestre-Alvise Rebuffi, Pierre Fernandez et al.
NEURIPS 2025spotlightarXiv:2510.20468
TransferBench: Benchmarking Ensemble-based Black-box Transfer Attacks
Fabio Brau, Maura Pintor, Antonio Cinà et al.
NEURIPS 2025poster