"next token prediction" Papers
4 papers found
A Simple yet Effective Layout Token in Large Language Models for Document Understanding
Zhaoqing Zhu, Chuwei Luo, Zirui Shao et al.
CVPR 2025posterarXiv:2503.18434
7
citations
Arrows of Time for Large Language Models
Vassilis Papadopoulos, Jérémie Wenger, Clement Hongler
ICML 2024poster
How do Transformers Perform In-Context Autoregressive Learning ?
Michael Sander, Raja Giryes, Taiji Suzuki et al.
ICML 2024poster
On the Origins of Linear Representations in Large Language Models
Yibo Jiang, Goutham Rajendran, Pradeep Ravikumar et al.
ICML 2024poster