"transformer layers" Papers
3 papers found
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Lucas Bandarkar, Benjamin Muller, Pritish Yuvraj et al.
ICLR 2025posterarXiv:2410.01335
13
citations
Lines of Thought in Large Language Models
Raphaël Sarfati, Toni Liu, Nicolas Boulle et al.
ICLR 2025posterarXiv:2410.01545
1
citations
Whose Instructions Count? Resolving Preference Bias in Instruction Fine-Tuning
Jiayu Zhang, Changbang Li, Yinan Peng et al.
NEURIPS 2025poster