Nicholas Carlini
8
Papers
80
Total Citations
Papers (8)
Adversarial Perturbations Cannot Reliably Protect Artists From Generative AI
ICLR 2025
35
citations
Persistent Pre-training Poisoning of LLMs
ICLR 2025
34
citations
AutoAdvExBench: Benchmarking Autonomous Exploitation of Adversarial Example Defenses
ICML 2025
8
citations
Position: In-House Evaluation Is Not Enough. Towards Robust Third-Party Evaluation and Flaw Disclosure for General-Purpose AI
ICML 2025
2
citations
IF-Guide: Influence Function-Guided Detoxification of LLMs
NeurIPS 2025
1
citations
Initialization Matters for Adversarial Transfer Learning
CVPR 2024
0
citations
Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining
ICML 2024
0
citations
Stealing part of a production language model
ICML 2024
0
citations