by Taylor W. Killian Papers
2 papers found
BraVE: Offline Reinforcement Learning for Discrete Combinatorial Action Spaces
Matthew Landers, Taylor W. Killian, Hugo Barnes et al.
NeurIPS 2025poster
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Jorge (Zhoujun) Cheng, Shibo Hao, Tianyang Liu et al.
NeurIPS 2025posterarXiv:2506.14965
35
citations