Yazhe Niu
3
Papers
27
Total Citations
Papers (3)
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
AAAI 2024arXiv
25
citations
Hierachical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM
NeurIPS 2025
2
citations
Pretrained Reversible Generation as Unsupervised Visual Representation Learning
ICCV 2025
0
citations