ICLR 2025 "adversarial robustness" Papers

21 papers found

$\sigma$-zero: Gradient-based Optimization of $\ell_0$-norm Adversarial Examples

Antonio Emanuele Cinà, Francesco Villani, Maura Pintor et al.

ICLR 2025poster

Adversarial Attacks on Data Attribution

Xinhe Wang, Pingbang Hu, Junwei Deng et al.

ICLR 2025posterarXiv:2409.05657

Adversarially Robust Anomaly Detection through Spurious Negative Pair Mitigation

Hossein Mirzaei Sadeghlou, Mojtaba Nafez, Jafar Habibi et al.

ICLR 2025poster

Artificial Kuramoto Oscillatory Neurons

Takeru Miyato, Sindy Löwe, Andreas Geiger et al.

ICLR 2025oralarXiv:2410.13821
22
citations

A Transfer Attack to Image Watermarks

Yuepeng Hu, Zhengyuan Jiang, Moyang Guo et al.

ICLR 2025posterarXiv:2403.15365
21
citations

ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning

Ruchika Chavhan, Da Li, Timothy Hospedales

ICLR 2025posterarXiv:2405.19237
34
citations

Confidence Elicitation: A New Attack Vector for Large Language Models

Brian Formento, Chuan Sheng Foo, See-Kiong Ng

ICLR 2025posterarXiv:2502.04643
2
citations

Dissecting Adversarial Robustness of Multimodal LM Agents

Chen Wu, Rishi Shah, Jing Yu Koh et al.

ICLR 2025posterarXiv:2406.12814
76
citations

Endowing Visual Reprogramming with Adversarial Robustness

Shengjie Zhou, Xin Cheng, Haiyang Xu et al.

ICLR 2025poster
2
citations

Feature Averaging: An Implicit Bias of Gradient Descent Leading to Non-Robustness in Neural Networks

Binghui Li, Zhixuan Pan, Kaifeng Lyu et al.

ICLR 2025posterarXiv:2410.10322

Improving Generalization and Robustness in SNNs Through Signed Rate Encoding and Sparse Encoding Attacks

Bhaskar Mukhoty, Hilal AlQuabeh, Bin Gu

ICLR 2025poster
2
citations

Indirect Gradient Matching for Adversarial Robust Distillation

Hongsin Lee, Seungju Cho, Changick Kim

ICLR 2025posterarXiv:2312.03286
3
citations

Learning Randomized Algorithms with Transformers

Johannes von Oswald, Seijin Kobayashi, Yassir Akram et al.

ICLR 2025posterarXiv:2408.10818
1
citations

Long-tailed Adversarial Training with Self-Distillation

Seungju Cho, Hongsin Lee, Changick Kim

ICLR 2025posterarXiv:2503.06461
1
citations

MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models

Chejian Xu, Jiawei Zhang, Zhaorun Chen et al.

ICLR 2025posterarXiv:2503.14827
9
citations

Provable Robust Overfitting Mitigation in Wasserstein Distributionally Robust Optimization

Shuang Liu, Yihan Wang, Yifan Zhu et al.

ICLR 2025posterarXiv:2503.04315

Resolution Attack: Exploiting Image Compression to Deceive Deep Neural Networks

Wangjia Yu, Xiaomeng Fu, Qiao Li et al.

ICLR 2025poster

Robust Conformal Prediction with a Single Binary Certificate

Soroush H. Zargarbashi, Aleksandar Bojchevski

ICLR 2025posterarXiv:2503.05239
3
citations

Robust Feature Learning for Multi-Index Models in High Dimensions

Alireza Mousavi-Hosseini, Adel Javanmard, Murat A Erdogdu

ICLR 2025posterarXiv:2410.16449
5
citations

Support is All You Need for Certified VAE Training

Changming Xu, Debangshu Banerjee, Deepak Vasisht et al.

ICLR 2025posterarXiv:2504.11831

Zero-cost Proxy for Adversarial Robustness Evaluation

Yuqi Feng, Yuwei Ou, Jiahao Fan et al.

ICLR 2025poster
1
citations