"expert load balancing" Papers
2 papers found
CryptoMoE: Privacy-Preserving and Scalable Mixture of Experts Inference via Balanced Expert Routing
Yifan Zhou, Tianshi Xu, Jue Hong et al.
NEURIPS 2025posterarXiv:2511.01197
1
citations
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing
Ziteng Wang, Jun Zhu, Jianfei Chen
ICLR 2025posterarXiv:2412.14711
28
citations