Han Zhong
6
Papers
8
Total Citations
Papers (6)
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
ICML 2025
8
citations
A3S: A General Active Clustering Method with Pairwise Constraints
ICML 2024
0
citations
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
ICML 2024
0
citations
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret
ICML 2024
0
citations
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
ICML 2024
0
citations
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint
ICML 2024
0
citations