"multilingual pretraining" Papers
2 papers found
Conference
ALLaM: Large Language Models for Arabic and English
M Saiful Bari, Yazeed Alnumay, Norah Alzahrani et al.
ICLR 2025arXiv:2407.15390
49
citations
Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Nathan Godey, Éric Clergerie, Benoît Sagot
ICLR 2024arXiv:2309.08351
5
citations