Francesco Croce
5
Papers
405
Total Citations
Papers (5)
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks
ICLR 2025
375
citations
OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents
NeurIPS 2025arXiv
18
citations
Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models
ECCV 2024
12
citations
Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
ICML 2024
0
citations
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning
ICML 2024
0
citations