Poster "black-box attacks" Papers

12 papers found

A Technical Report on “Erasing the Invisible”: The 2024 NeurIPS Competition on Stress Testing Image Watermarks

Mucong Ding, Bang An, Tahseen Rabbani et al.

NeurIPS 2025poster

Confidence Elicitation: A New Attack Vector for Large Language Models

Brian Formento, Chuan Sheng Foo, See-Kiong Ng

ICLR 2025posterarXiv:2502.04643
2
citations

Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning

Yinglun Xu, Qi Zeng, Gagandeep Singh

ICLR 2025posterarXiv:2205.14842
8
citations

GSBA$^K$: $top$-$K$ Geometric Score-based Black-box Attack

Md Farhamdur Reza, Richeng Jin, Tianfu Wu et al.

ICLR 2025posterarXiv:2503.12827
2
citations

IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves

Ruofan Wang, Juncheng Li, Yixu Wang et al.

ICCV 2025posterarXiv:2411.00827
8
citations

Reasoning as an Adaptive Defense for Safety

Taeyoun Kim, Fahim Tajwar, Aditi Raghunathan et al.

NeurIPS 2025posterarXiv:2507.00971
9
citations

Training Robust Ensembles Requires Rethinking Lipschitz Continuity

Ali Ebrahimpour Boroojeny, Hari Sundaram, Varun Chandrasekaran

ICLR 2025poster
1
citations

TransferBench: Benchmarking Ensemble-based Black-box Transfer Attacks

Fabio Brau, Maura Pintor, Antonio Cinà et al.

NeurIPS 2025poster

Zero-cost Proxy for Adversarial Robustness Evaluation

Yuqi Feng, Yuwei Ou, Jiahao Fan et al.

ICLR 2025poster
1
citations

BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks

Zhiyuan Cheng, Zhaoyi Liu, Tengda Guo et al.

ICML 2024poster

Data Poisoning Attacks against Conformal Prediction

Yangyi Li, Aobo Chen, Wei Qian et al.

ICML 2024poster

Inter-Class Topology Alignment for Efficient Black-Box Substitute Attacks

lingzhuang meng, Mingwen Shao, Yuanjian Qiao et al.

ECCV 2024poster
1
citations