Zhaoran Wang
7
Papers
15
Total Citations
Papers (7)
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
ICML 2025
8
citations
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
ICML 2025
5
citations
Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?
ICLR 2025
2
citations
How Does Goal Relabeling Improve Sample Efficiency?
ICML 2024
0
citations
Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable Simulations
ICML 2024
0
citations
Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents
ICML 2024
0
citations
A General Framework for Sequential Decision-Making under Adaptivity Constraints
ICML 2024
0
citations