Saumitra Mishra
6
Papers
13
Total Citations
Papers (6)
Interpreting Language Reward Models via Contrastive Explanations
ICLR 2025
5
citations
Quantifying Prediction Consistency Under Fine-tuning Multiplicity in Tabular LLMs
ICML 2025
4
citations
Representation Consistency for Accurate and Coherent LLM Answer Aggregation
NeurIPS 2025arXiv
2
citations
To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models
ICML 2025
2
citations
Progressive Inference: Explaining Decoder-Only Sequence Classification Models Using Intermediate Predictions
ICML 2024
0
citations
Counterfactual Metarules for Local and Global Recourse
ICML 2024
0
citations