NeurIPS Poster "adversarial attacks" Papers
10 papers found
Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment
Xiaojun Jia, Sensen Gao, Simeng Qin et al.
NeurIPS 2025posterarXiv:2505.21494
12
citations
Adversary Aware Optimization for Robust Defense
Daniel Wesego, Pedram Rooshenas
NeurIPS 2025poster
DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches
Yun Xing, Yue Cao, Nhat Chung et al.
NeurIPS 2025posterarXiv:2506.16690
Fit the Distribution: Cross-Image/Prompt Adversarial Attacks on Multimodal Large Language Models
Hai Yan, Haijian Ma, Xiaowen Cai et al.
NeurIPS 2025poster
IPAD: Inverse Prompt for AI Detection - A Robust and Interpretable LLM-Generated Text Detector
Zheng CHEN, Yushi Feng, Jisheng Dang et al.
NeurIPS 2025posterarXiv:2502.15902
Keeping an Eye on LLM Unlearning: The Hidden Risk and Remedy
Jie Ren, Zhenwei Dai, Xianfeng Tang et al.
NeurIPS 2025posterarXiv:2506.00359
6
citations
MIP against Agent: Malicious Image Patches Hijacking Multimodal OS Agents
Lukas Aichberger, Alasdair Paren, Guohao Li et al.
NeurIPS 2025posterarXiv:2503.10809
10
citations
Non-Adaptive Adversarial Face Generation
Sunpill Kim, Seunghun Paik, Chanwoo Hwang et al.
NeurIPS 2025posterarXiv:2507.12107
1
citations
SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations
Buyun Liang, Liangzu Peng, Jinqi Luo et al.
NeurIPS 2025posterarXiv:2510.04398
Stochastic Regret Guarantees for Online Zeroth- and First-Order Bilevel Optimization
Parvin Nazari, Bojian Hou, Davoud Ataee Tarzanagh et al.
NeurIPS 2025posterarXiv:2511.01126