Joey Hong
4
Papers
91
Total Citations
Papers (4)
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
ICML 2025
63
citations
ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis
ICLR 2024
20
citations
Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
ICLR 2025
8
citations
Learning to Explore in POMDPs with Informational Rewards
ICML 2024
0
citations