Most Cited 2025 Spotlight by Fan Lai Papers
3 papers found
Conference
#1
HyGen: Efficient LLM Serving via Elastic Online-Offline Request Co-location
Ting Sun, Penghan Wang, Fan Lai
NEURIPS 2025posterarXiv:2501.14808
7
citations
#2
Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models
Haoyi Song, Ruihan Ji, Naichen Shi et al.
NEURIPS 2025posterarXiv:2506.09684
1
citations
#3
Act Only When It Pays: Efficient Reinforcement Learning for LLM Reasoning via Selective Rollouts
Haizhong Zheng, Yang Zhou, Brian Bartoldson et al.
NEURIPS 2025oralarXiv:2506.02177