Poster by Eiko Yoneki Papers
2 papers found
Demystifying Cost-Efficiency in LLM Serving over Heterogeneous GPUs
Youhe Jiang, Fangcheng Fu, Xiaozhe Yao et al.
ICML 2025posterarXiv:2502.00722
Efficient Pre-Training of LLMs via Topology-Aware Communication Alignment on More Than 9600 GPUs
Guoliang He, Youhe Jiang, Wencong Xiao et al.
NEURIPS 2025posterarXiv:2509.15940