2025 Poster "gradient estimation" Papers
3 papers found
PseuZO: Pseudo-Zeroth-Order Algorithm for Training Deep Neural Networks
Pengyun Yue, Xuanlin Yang, Mingqing Xiao et al.
NeurIPS 2025poster
Soft Merging of Experts with Adaptive Routing
Haokun Liu, Muqeeth Mohammed, Colin Raffel
ICLR 2025posterarXiv:2306.03745
81
citations
Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning
Yong Liu, Zirui Zhu, Chaoyu Gong et al.
NeurIPS 2025posterarXiv:2402.15751
36
citations