ICLR 2025 "decoder-only architectures" Papers
2 papers found
Making Text Embedders Few-Shot Learners
Chaofan Li, Minghao Qin, Shitao Xiao et al.
ICLR 2025posterarXiv:2409.15700
86
citations
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
Oskar van der Wal, Pietro Lesci, Max Müller-Eberstein et al.
ICLR 2025posterarXiv:2503.09543
14
citations