by Rohan Deb Papers
3 papers found
Conservative Contextual Bandits: Beyond Linear Representations
Rohan Deb, Mohammad Ghavamzadeh, Arindam Banerjee
ICLR 2025posterarXiv:2412.06165
1
citations
FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain
Rohan Deb, Kiran Thekumparampil, Kousha Kalantari et al.
ICML 2025posterarXiv:2505.14826
Contextual Bandits with Online Neural Regression
Rohan Deb, Yikun Ban, Shiliang Zuo et al.
ICLR 2024posterarXiv:2312.07145