Bettina Messmer
3
Papers
14
Total Citations
Papers (3)
Enhancing Multilingual LLM Pretraining with Model-Based Data Selection
NeurIPS 2025arXiv
10
citations
On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists
ICML 2025
4
citations
Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks
ICML 2024
0
citations