Poster "white-box attacks" Papers
3 papers found
Reasoning as an Adaptive Defense for Safety
Taeyoun Kim, Fahim Tajwar, Aditi Raghunathan et al.
NeurIPS 2025posterarXiv:2507.00971
9
citations
Zero-cost Proxy for Adversarial Robustness Evaluation
Yuqi Feng, Yuwei Ou, Jiahao Fan et al.
ICLR 2025poster
1
citations
Robustness Tokens: Towards Adversarial Robustness of Transformers
Brian Pulfer, Yury Belousov, Slava Voloshynovskiy
ECCV 2024posterarXiv:2503.10191