2025 "adversarial robustness" Papers

50 papers found

$\sigma$-zero: Gradient-based Optimization of $\ell_0$-norm Adversarial Examples

Antonio Emanuele Cinà, Francesco Villani, Maura Pintor et al.

ICLR 2025poster

Accelerated Vertical Federated Adversarial Learning through Decoupling Layer-Wise Dependencies

Tianxing Man, Yu Bai, Ganyu Wang et al.

NEURIPS 2025poster

Adversarial Attacks on Data Attribution

Xinhe Wang, Pingbang Hu, Junwei Deng et al.

ICLR 2025posterarXiv:2409.05657

Adversarially Robust Anomaly Detection through Spurious Negative Pair Mitigation

Hossein Mirzaei Sadeghlou, Mojtaba Nafez, Jafar Habibi et al.

ICLR 2025poster

Adversarial Robustness of Discriminative Self-Supervised Learning in Vision

Ömer Veysel Çağatan, Ömer TAL, M. Emre Gursoy

ICCV 2025posterarXiv:2503.06361

Alias-Free ViT: Fractional Shift Invariance via Linear Attention

Hagay Michaeli, Daniel Soudry

NEURIPS 2025posterarXiv:2510.22673

Artificial Kuramoto Oscillatory Neurons

Takeru Miyato, Sindy Löwe, Andreas Geiger et al.

ICLR 2025oralarXiv:2410.13821
22
citations

A Transfer Attack to Image Watermarks

Yuepeng Hu, Zhengyuan Jiang, Moyang Guo et al.

ICLR 2025posterarXiv:2403.15365
21
citations

Attack by Yourself: Effective and Unnoticeable Multi-Category Graph Backdoor Attacks with Subgraph Triggers Pool

Jiangtong Li, Dongyi Liu, Kun Zhu et al.

NEURIPS 2025posterarXiv:2412.17213
2
citations

AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs

Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta et al.

ICCV 2025posterarXiv:2501.02135
9
citations

Bridging Symmetry and Robustness: On the Role of Equivariance in Enhancing Adversarial Robustness

Longwei Wang, Ifrat Ikhtear Uddin, Prof. KC Santosh (PhD) et al.

NEURIPS 2025spotlightarXiv:2510.16171
2
citations

Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks

Peng Xie, Yequan Bie, Jianda Mao et al.

CVPR 2025posterarXiv:2411.15720
10
citations

ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning

Ruchika Chavhan, Da Li, Timothy Hospedales

ICLR 2025posterarXiv:2405.19237
34
citations

Confidence Elicitation: A New Attack Vector for Large Language Models

Brian Formento, Chuan Sheng Foo, See-Kiong Ng

ICLR 2025posterarXiv:2502.04643
2
citations

DeDe: Detecting Backdoor Samples for SSL Encoders via Decoders

Sizai Hou, Songze Li, Duanyi Yao

CVPR 2025posterarXiv:2411.16154

Dissecting Adversarial Robustness of Multimodal LM Agents

Chen Wu, Rishi Shah, Jing Yu Koh et al.

ICLR 2025posterarXiv:2406.12814
76
citations

DNA-DetectLLM: Unveiling AI-Generated Text via a DNA-Inspired Mutation-Repair Paradigm

Xiaowei Zhu, Yubing Ren, Fang Fang et al.

NEURIPS 2025spotlightarXiv:2509.15550

Dynamical Low-Rank Compression of Neural Networks with Robustness under Adversarial Attacks

Steffen Schotthöfer, Lexie Yang, Stefan Schnake

NEURIPS 2025oralarXiv:2505.08022
6
citations

Endowing Visual Reprogramming with Adversarial Robustness

Shengjie Zhou, Xin Cheng, Haiyang Xu et al.

ICLR 2025poster
2
citations

Enhancing Graph Classification Robustness with Singular Pooling

Sofiane Ennadir, Oleg Smirnov, Yassine ABBAHADDOU et al.

NEURIPS 2025posterarXiv:2510.22643

ErrorTrace: A Black-Box Traceability Mechanism Based on Model Family Error Space

Chuanchao Zang, Xiangtao Meng, Wenyu Chen et al.

NEURIPS 2025spotlight

Feature Averaging: An Implicit Bias of Gradient Descent Leading to Non-Robustness in Neural Networks

Binghui Li, Zhixuan Pan, Kaifeng Lyu et al.

ICLR 2025posterarXiv:2410.10322

FrameShield: Adversarially Robust Video Anomaly Detection

Mojtaba Nafez, Mobina Poulaei, Nikan Vasei et al.

NEURIPS 2025oralarXiv:2510.21532

GCAV: A Global Concept Activation Vector Framework for Cross-Layer Consistency in Interpretability

Zhenghao He, Sanchit Sinha, Guangzhi Xiong et al.

ICCV 2025posterarXiv:2508.21197

Improving Generalization and Robustness in SNNs Through Signed Rate Encoding and Sparse Encoding Attacks

Bhaskar Mukhoty, Hilal AlQuabeh, Bin Gu

ICLR 2025poster
2
citations

Indirect Gradient Matching for Adversarial Robust Distillation

Hongsin Lee, Seungju Cho, Changick Kim

ICLR 2025posterarXiv:2312.03286
3
citations

Learning Randomized Algorithms with Transformers

Johannes von Oswald, Seijin Kobayashi, Yassir Akram et al.

ICLR 2025posterarXiv:2408.10818
1
citations

LLM Unlearning via Neural Activation Redirection

William Shen, Xinchi Qiu, Meghdad Kurmanji et al.

NEURIPS 2025posterarXiv:2502.07218

Long-tailed Adversarial Training with Self-Distillation

Seungju Cho, Hongsin Lee, Changick Kim

ICLR 2025posterarXiv:2503.06461
1
citations

LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders

Borna Khodabandeh, Amirabbas Afzali, Amirhossein Afsharrad et al.

NEURIPS 2025posterarXiv:2505.18884

MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models

Chejian Xu, Jiawei Zhang, Zhaorun Chen et al.

ICLR 2025posterarXiv:2503.14827
9
citations

MUNBa: Machine Unlearning via Nash Bargaining

Jing Wu, Mehrtash Harandi

ICCV 2025posterarXiv:2411.15537
7
citations

NAPPure: Adversarial Purification for Robust Image Classification under Non-Additive Perturbations

Junjie Nan, Jianing Li, Wei Chen et al.

ICCV 2025posterarXiv:2510.14025

PatchGuard: Adversarially Robust Anomaly Detection and Localization through Vision Transformers and Pseudo Anomalies

Mojtaba Nafez, Amirhossein Koochakian, Arad Maleki et al.

CVPR 2025posterarXiv:2506.09237
2
citations

Provable Robust Overfitting Mitigation in Wasserstein Distributionally Robust Optimization

Shuang Liu, Yihan Wang, Yifan Zhu et al.

ICLR 2025posterarXiv:2503.04315

Reducing the Probability of Undesirable Outputs in Language Models Using Probabilistic Inference

Stephen Zhao, Aidan Li, Rob Brekelmans et al.

NEURIPS 2025posterarXiv:2510.21184

ReliabilityRAG: Effective and Provably Robust Defense for RAG-based Web-Search

Zeyu Shen, Basileal Imana, Tong Wu et al.

NEURIPS 2025posterarXiv:2509.23519
1
citations

Resolution Attack: Exploiting Image Compression to Deceive Deep Neural Networks

Wangjia Yu, Xiaomeng Fu, Qiao Li et al.

ICLR 2025poster

Robust Conformal Prediction with a Single Binary Certificate

Soroush H. Zargarbashi, Aleksandar Bojchevski

ICLR 2025posterarXiv:2503.05239
3
citations

Robust Contextual Pricing

Anupam Gupta, Guru Guruganesh, Renato Leme et al.

NEURIPS 2025poster

Robust Feature Learning for Multi-Index Models in High Dimensions

Alireza Mousavi-Hosseini, Adel Javanmard, Murat A Erdogdu

ICLR 2025posterarXiv:2410.16449
5
citations

Robust SuperAlignment: Weak-to-Strong Robustness Generalization for Vision-Language Models

Junhao Dong, Cong Zhang, Xinghua Qu et al.

NEURIPS 2025spotlight

Support is All You Need for Certified VAE Training

Changming Xu, Debangshu Banerjee, Deepak Vasisht et al.

ICLR 2025posterarXiv:2504.11831

Synergy Between the Strong and the Weak: Spiking Neural Networks are Inherently Self-Distillers

Yongqi Ding, Lin Zuo, Mengmeng Jing et al.

NEURIPS 2025oralarXiv:2510.07924

Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment

Kejia Zhang, Juanjuan Weng, Zhiming Luo et al.

ICCV 2025posterarXiv:2408.06079
2
citations

Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models

Yoojin Jung, Byung Cheol Song

CVPR 2025posterarXiv:2504.04747
1
citations

Understanding and Improving Adversarial Robustness of Neural Probabilistic Circuits

Weixin Chen, Han Zhao

NEURIPS 2025posterarXiv:2509.20549

WMCopier: Forging Invisible Watermarks on Arbitrary Images

Ziping Dong, Chao Shuai, Zhongjie Ba et al.

NEURIPS 2025poster

Your Text Encoder Can Be An Object-Level Watermarking Controller

Naresh Kumar Devulapally, Mingzhen Huang, Vishal Asnani et al.

ICCV 2025posterarXiv:2503.11945

Zero-cost Proxy for Adversarial Robustness Evaluation

Yuqi Feng, Yuwei Ou, Jiahao Fan et al.

ICLR 2025poster
1
citations