Poster "mixture-of-experts architecture" Papers
4 papers found
Layerwise Recurrent Router for Mixture-of-Experts
Zihan Qiu, Zeyu Huang, Shuang Cheng et al.
ICLR 2025posterarXiv:2408.06793
7
citations
MEGADance: Mixture-of-Experts Architecture for Genre-Aware 3D Dance Generation
kaixing yang, Xulong Tang, Ziqiao Peng et al.
NeurIPS 2025posterarXiv:2505.17543
5
citations
MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging
Zihuan Qiu, Yi Xu, Chiyuan He et al.
NeurIPS 2025posterarXiv:2505.11883
5
citations
Mixture of Parrots: Experts improve memorization more than reasoning
Samy Jelassi, Clara Mohri, David Brandfonbrener et al.
ICLR 2025posterarXiv:2410.19034
14
citations