2025 Poster by Fan Lai Papers
2 papers found
HyGen: Efficient LLM Serving via Elastic Online-Offline Request Co-location
Ting Sun, Penghan Wang, Fan Lai
NEURIPS 2025posterarXiv:2501.14808
7
citations
Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models
Haoyi Song, Ruihan Ji, Naichen Shi et al.
NEURIPS 2025posterarXiv:2506.09684
1
citations