2024 "sparse mixture-of-experts" Papers
2 papers found
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts
Byeongjun Park, Hyojun Go, Jin-Young Kim et al.
ECCV 2024posterarXiv:2403.09176
23
citations
Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts
Shengzhuang Chen, Jihoon Tack, Yunqiao Yang et al.
ICML 2024posterarXiv:2403.08477