2025 "language models" Papers
12 papers found
Better Estimation of the Kullback--Leibler Divergence Between Language Models
Afra Amini, Tim Vieira, Ryan Cotterell
NeurIPS 2025posterarXiv:2504.10637
Emergence of Linear Truth Encodings in Language Models
Shauli Ravfogel, Gilad Yehudai, Tal Linzen et al.
NeurIPS 2025posterarXiv:2510.15804
3
citations
Fragment and Geometry Aware Tokenization of Molecules for Structure-Based Drug Design Using Language Models
Cong Fu, Xiner Li, Blake Olson et al.
ICLR 2025posterarXiv:2408.09730
10
citations
Mechanistic Permutability: Match Features Across Layers
Nikita Balagansky, Ian Maksimov, Daniil Gavrilov
ICLR 2025posterarXiv:2410.07656
14
citations
Multi-modal Learning: A Look Back and the Road Ahead
Divyam Madaan, Sumit Chopra, Kyunghyun Cho
ICLR 2025poster
MUSE: Machine Unlearning Six-Way Evaluation for Language Models
Weijia Shi, Jaechan Lee, Yangsibo Huang et al.
ICLR 2025posterarXiv:2407.06460
157
citations
Revisiting Random Walks for Learning on Graphs
Jinwoo Kim, Olga Zaghen, Ayhan Suleymanzade et al.
ICLR 2025posterarXiv:2407.01214
7
citations
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models
Makoto Shing, Kou Misaki, Han Bao et al.
ICLR 2025oralarXiv:2501.16937
12
citations
The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning
Xinyu Zhu, Mengzhou Xia, Zhepei Wei et al.
NeurIPS 2025posterarXiv:2506.01347
74
citations
TopoNets: High performing vision and language models with brain-like topography
Mayukh Deb, Mainak Deb, Apurva Murty
ICLR 2025posterarXiv:2501.16396
11
citations
Transformers without Normalization
Jiachen Zhu, Xinlei Chen, Kaiming He et al.
CVPR 2025posterarXiv:2503.10622
96
citations
Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It
Yulu Qin, Dheeraj Varghese, Adam Dahlgren Lindström et al.
NeurIPS 2025oralarXiv:2507.13328