"next-token prediction" Papers
11 papers found
Context-Aware Regularization with Markovian Integration for Attention-Based Nucleotide Analysis
Mohammad Saleh Refahi, Mahdi Abavisani, Bahrad Sokhansanj et al.
NeurIPS 2025posterarXiv:2507.09378
Correlation and Navigation in the Vocabulary Key Representation Space of Language Models
Letian Peng, Chenyang An, Jingbo Shang
ICLR 2025posterarXiv:2410.02284
OmniGen-AR: AutoRegressive Any-to-Image Generation
Junke Wang, Xun Wang, Qiushan Guo et al.
NeurIPS 2025poster
PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs
Teng Zhou, Xiaoyu Zhang, Yongchuan Tang
ICCV 2025highlightarXiv:2411.15867
4
citations
Re-Thinking Inverse Graphics With Large Language Models
Haiwen Feng, Michael J Black, Weiyang Liu et al.
ICLR 2025posterarXiv:2404.15228
15
citations
Scaling Laws for Gradient Descent and Sign Descent for Linear Bigram Models under Zipf’s Law
Frederik Kunstner, Francis Bach
NeurIPS 2025posterarXiv:2505.19227
7
citations
VladVA: Discriminative Fine-tuning of LVLMs
Yassine Ouali, Adrian Bulat, ALEXANDROS XENOS et al.
CVPR 2025posterarXiv:2412.04378
11
citations
Auto-Regressive Next-Token Predictors are Universal Learners
Eran Malach
ICML 2024poster
Tandem Transformers for Inference Efficient LLMs
Aishwarya P S, Pranav Nair, Yashas Samaga et al.
ICML 2024poster
The Pitfalls of Next-Token Prediction
Gregor Bachmann, Vaishnavh Nagarajan
ICML 2024poster
Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation
Xinyi Wang, Alfonso Amayuelas, Kexun Zhang et al.
ICML 2024poster