Shenao Zhang
4
Papers
13
Total Citations
Papers (4)
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
ICML 2025
8
citations
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
ICML 2025
5
citations
Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable Simulations
ICML 2024
0
citations
Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents
ICML 2024
0
citations