Poster by Vinko Sabolčec Papers
2 papers found
Enhancing Multilingual LLM Pretraining with Model-Based Data Selection
Bettina Messmer, Vinko Sabolčec, Martin Jaggi
NeurIPS 2025posterarXiv:2502.10361
10
citations
URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training
Dongyang Fan, Vinko Sabolčec, Martin Jaggi
NeurIPS 2025posterarXiv:2505.16570