α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Subhojyoti Mukherjee
Subhojyoti Mukherjee
2
Papers
2
Total Citations
Papers (2)
Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization
NeurIPS 2025
arXiv
2
citations
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
ICML 2024
0
citations