ICLR 2025 "black-box attacks" Papers
5 papers found
Confidence Elicitation: A New Attack Vector for Large Language Models
Brian Formento, Chuan Sheng Foo, See-Kiong Ng
ICLR 2025posterarXiv:2502.04643
2
citations
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning
Yinglun Xu, Qi Zeng, Gagandeep Singh
ICLR 2025posterarXiv:2205.14842
8
citations
GSBA$^K$: $top$-$K$ Geometric Score-based Black-box Attack
Md Farhamdur Reza, Richeng Jin, Tianfu Wu et al.
ICLR 2025posterarXiv:2503.12827
2
citations
Training Robust Ensembles Requires Rethinking Lipschitz Continuity
Ali Ebrahimpour Boroojeny, Hari Sundaram, Varun Chandrasekaran
ICLR 2025poster
1
citations
Zero-cost Proxy for Adversarial Robustness Evaluation
Yuqi Feng, Yuwei Ou, Jiahao Fan et al.
ICLR 2025poster
1
citations