Yazhe Niu

3

Papers

27

Total Citations

Papers (3)

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

Hierachical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM

Pretrained Reversible Generation as Unsupervised Visual Representation Learning