by Xiaohua Jia Papers
2 papers found
CAT: Contrastive Adversarial Training for Evaluating the Robustness of Protective Perturbations in Latent Diffusion Models
Sen Peng, Mingyue Wang, Jianfei He et al.
ICML 2025posterarXiv:2502.07225
The Hidden Dimensions of LLM Alignment: A Multi-Dimensional Analysis of Orthogonal Safety Directions
Wenbo Pan, Zhichao Liu, Qiguang Chen et al.
ICML 2025posterarXiv:2502.09674