Somesh Jha
4
Papers
107
Total Citations
Papers (4)
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs
ICLR 2025arXiv
106
citations
Validating Mechanistic Interpretations: An Axiomatic Approach
ICML 2025
1
citations
Do Large Code Models Understand Programming Concepts? Counterfactual Analysis for Code Predicates
ICML 2024
0
citations
Two Heads are Actually Better than One: Towards Better Adversarial Robustness via Transduction and Rejection
ICML 2024
0
citations