Somesh Jha
4
Papers
101
Total Citations
Papers (4)
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs
ICLR 2025arXiv
100
citations
Validating Mechanistic Interpretations: An Axiomatic Approach
ICML 2025arXiv
1
citations
Do Large Code Models Understand Programming Concepts? Counterfactual Analysis for Code Predicates
ICML 2024
0
citations
Two Heads are Actually Better than One: Towards Better Adversarial Robustness via Transduction and Rejection
ICML 2024
0
citations