Kianté Brantley
5
Papers
31
Total Citations
Papers (5)
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
ICLR 2025
14
citations
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
NeurIPS 2025arXiv
10
citations
Value-Guided Search for Efficient Chain-of-Thought Reasoning
NeurIPS 2025
7
citations
Coactive Learning for Large Language Models using Implicit User Feedback
ICML 2024
0
citations
When is Transfer Learning Possible?
ICML 2024
0
citations