2024 Spotlight "scaling laws" Papers
2 papers found
Mixtures of Experts Unlock Parameter Scaling for Deep RL
Johan Obando Ceron, Ghada Sokar, Timon Willi et al.
ICML 2024spotlightarXiv:2402.08609
Navigating Scaling Laws: Compute Optimality in Adaptive Model Training
Sotiris Anagnostidis, Gregor Bachmann, Imanol Schlag et al.
ICML 2024spotlightarXiv:2311.03233