"parameter initialization" Papers
4 papers found
Big Learning Expectation Maximization
Yulai Cong, Sijia Li
AAAI 2024paperarXiv:2312.11926
Idling Neurons, Appropriately Lenient Workload During Fine-tuning Leads to Better Generalization
Hongjing Niu, Hanting Li, Bin Li et al.
ECCV 2024poster
Rethinking Optimization and Architecture for Tiny Language Models
Yehui Tang, Kai Han, Fangcheng Liu et al.
ICML 2024poster
Stability-Informed Initialization of Neural Ordinary Differential Equations
Theodor Westny, Arman Mohammadi, Daniel Jung et al.
ICML 2024poster