NEURIPS 2025 "distributed training" Papers
3 papers found
DUO: No Compromise to Accuracy Degradation
Jinda Jia, Cong Xie, Hanlin Lu et al.
NEURIPS 2025poster
MeCeFO: Enhancing LLM Training Robustness via Fault-Tolerant Optimization
Rizhen Hu, Yutong He, Ran Yan et al.
NEURIPS 2025posterarXiv:2510.16415
StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs
Qijun Luo, Mengqi Li, Lei Zhao et al.
NEURIPS 2025posterarXiv:2506.03077
1
citations