Archit Sharma
7
Papers
15
Total Citations
Papers (7)
Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
ICLR 2025
15
citations
RLVF: Learning from Verbal Feedback without Overgeneralization
ICML 2024
0
citations
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
ICML 2024
0
citations
Autonomous Reinforcement Learning via Subgoal Curricula
NeurIPS 2021
0
citations
You Only Live Once: Single-Life Reinforcement Learning
NeurIPS 2022
0
citations
When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning
NeurIPS 2022
0
citations
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
NeurIPS 2023
0
citations