"gating mechanisms" Papers
3 papers found
Gated Delta Networks: Improving Mamba2 with Delta Rule
Songlin Yang, Jan Kautz, Ali Hatamizadeh
ICLR 2025posterarXiv:2412.06464
145
citations
Learning to Specialize: Joint Gating-Expert Training for Adaptive MoEs in Decentralized Settings
Yehya Farhat, Hamza ElMokhtar Shili, Fangshuo Liao et al.
NeurIPS 2025posterarXiv:2306.08586
3
citations
Is Kernel Prediction More Powerful than Gating in Convolutional Neural Networks?
Lorenz K. Muller
ICML 2024poster