2025 "memory-efficient optimization" Papers
5 papers found
ACCO: Accumulate While You Communicate for Communication-Overlapped Sharded LLM Training
Adel Nabli, Louis Fournier, Pierre ERBACHER et al.
NeurIPS 2025posterarXiv:2406.02613
2
citations
Efficient Adaptive Federated Optimization
Su Hyeong Lee, Sidharth Sharma, Manzil Zaheer et al.
NeurIPS 2025posterarXiv:2410.18117
2
citations
MeCeFO: Enhancing LLM Training Robustness via Fault-Tolerant Optimization
Rizhen Hu, Yutong He, Ran Yan et al.
NeurIPS 2025posterarXiv:2510.16415
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
Yuxi Liu, Renjia Deng, Yutong He et al.
NeurIPS 2025posterarXiv:2511.00056
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
Tianjin Huang, Ziquan Zhu, Gaojie Jin et al.
ICLR 2025posterarXiv:2501.06842
15
citations