Yazhe Niu
4
papers
27
total citations
papers (4)
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
AAAI 2024arXiv
25
citations
Hierachical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM
NeurIPS 2025
2
citations
Pretrained Reversible Generation as Unsupervised Visual Representation Learning
ICCV 2025arXiv
0
citations
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
NeurIPS 2023arXiv
0
citations