Somesh Jha
15
Papers
101
Total Citations
Papers (15)
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs
ICLR 2025
100
citations
Validating Mechanistic Interpretations: An Axiomatic Approach
ICML 2025
1
citations
Do Large Code Models Understand Programming Concepts? Counterfactual Analysis for Code Predicates
ICML 2024
0
citations
Two Heads are Actually Better than One: Towards Better Adversarial Robustness via Transduction and Rejection
ICML 2024
0
citations
Robust Attribution Regularization
NeurIPS 2019
0
citations
Attribution-Based Confidence Metric For Deep Neural Networks
NeurIPS 2019
0
citations
A Separation Result Between Data-oblivious and Data-aware Poisoning Attacks
NeurIPS 2021
0
citations
Detecting Errors and Estimating Accuracy on Unlabeled Data with Self-training Ensembles
NeurIPS 2021
0
citations
Overparameterization from Computational Constraints
NeurIPS 2022
0
citations
Robust Learning against Relational Adversaries
NeurIPS 2022
0
citations
A Quantitative Geometric Approach to Neural-Network Smoothness
NeurIPS 2022
0
citations
Grounding Neural Inference with Satisfiability Modulo Theories
NeurIPS 2023
0
citations
Robust and Actively Secure Serverless Collaborative Learning
NeurIPS 2023
0
citations
Analyzing the Robustness of Nearest Neighbors to Adversarial Examples
ICML 2018
0
citations
Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training
ICML 2018
0
citations