Poster "black-box attacks" Papers
12 papers found
A Technical Report on “Erasing the Invisible”: The 2024 NeurIPS Competition on Stress Testing Image Watermarks
Mucong Ding, Bang An, Tahseen Rabbani et al.
NeurIPS 2025poster
Confidence Elicitation: A New Attack Vector for Large Language Models
Brian Formento, Chuan Sheng Foo, See-Kiong Ng
ICLR 2025posterarXiv:2502.04643
2
citations
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning
Yinglun Xu, Qi Zeng, Gagandeep Singh
ICLR 2025posterarXiv:2205.14842
8
citations
GSBA$^K$: $top$-$K$ Geometric Score-based Black-box Attack
Md Farhamdur Reza, Richeng Jin, Tianfu Wu et al.
ICLR 2025posterarXiv:2503.12827
2
citations
IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves
Ruofan Wang, Juncheng Li, Yixu Wang et al.
ICCV 2025posterarXiv:2411.00827
8
citations
Reasoning as an Adaptive Defense for Safety
Taeyoun Kim, Fahim Tajwar, Aditi Raghunathan et al.
NeurIPS 2025posterarXiv:2507.00971
9
citations
Training Robust Ensembles Requires Rethinking Lipschitz Continuity
Ali Ebrahimpour Boroojeny, Hari Sundaram, Varun Chandrasekaran
ICLR 2025poster
1
citations
TransferBench: Benchmarking Ensemble-based Black-box Transfer Attacks
Fabio Brau, Maura Pintor, Antonio Cinà et al.
NeurIPS 2025poster
Zero-cost Proxy for Adversarial Robustness Evaluation
Yuqi Feng, Yuwei Ou, Jiahao Fan et al.
ICLR 2025poster
1
citations
BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks
Zhiyuan Cheng, Zhaoyi Liu, Tengda Guo et al.
ICML 2024poster
Data Poisoning Attacks against Conformal Prediction
Yangyi Li, Aobo Chen, Wei Qian et al.
ICML 2024poster
Inter-Class Topology Alignment for Efficient Black-Box Substitute Attacks
lingzhuang meng, Mingwen Shao, Yuanjian Qiao et al.
ECCV 2024poster
1
citations