2025 "gpu utilization" Papers
2 papers found
AREAL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning
Wei Fu, Jiaxuan Gao, Xujie Shen et al.
NeurIPS 2025posterarXiv:2505.24298
95
citations
DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference
Jinwei Yao, Kaiqi Chen, Kexun Zhang et al.
ICLR 2025posterarXiv:2404.00242
8
citations