2025 "next token prediction" Papers
4 papers found
A Simple yet Effective Layout Token in Large Language Models for Document Understanding
Zhaoqing Zhu, Chuwei Luo, Zirui Shao et al.
CVPR 2025posterarXiv:2503.18434
7
citations
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
Chenze Shao, Fandong Meng, Jie Zhou
ICLR 2025posterarXiv:2407.12665
2
citations
Bigram Subnetworks: Mapping to Next Tokens in Transformer Language Models
Tyler Chang, Benjamin Bergen
NEURIPS 2025spotlightarXiv:2504.15471
1
citations
Context Steering: Controllable Personalization at Inference Time
Zhiyang He, Sashrika Pandey, Mariah Schrum et al.
ICLR 2025posterarXiv:2405.01768
11
citations