Francesco Croce
6
Papers
409
Total Citations
Papers (6)
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks
ICLR 2025
375
citations
OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents
NeurIPS 2025arXiv
18
citations
Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models
ECCV 2024arXiv
12
citations
Selective induction Heads: How Transformers Select Causal Structures in Context
ICLR 2025arXiv
4
citations
Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
ICML 2024
0
citations
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning
ICML 2024
0
citations