Javier Rando
4
Papers
185
Total Citations
1
Affiliations
Affiliations
ETH Zurich
Papers (4)
Universal Jailbreak Backdoors from Poisoned Human Feedback
ICLR 2024
108
citations
Adversarial Perturbations Cannot Reliably Protect Artists From Generative AI
ICLR 2025
35
citations
Persistent Pre-training Poisoning of LLMs
ICLR 2025
34
citations
AutoAdvExBench: Benchmarking Autonomous Exploitation of Adversarial Example Defenses
ICML 2025
8
citations