"causal language models" Papers
4 papers found
Lookahead Routing for Large Language Models
Canbin Huang, Tianyuan Shi, Yuhua Zhu et al.
NEURIPS 2025posterarXiv:2510.19506
Non-Markovian Discrete Diffusion with Causal Language Models
Yangtian Zhang, Sizhuang He, Daniel Levine et al.
NEURIPS 2025oralarXiv:2502.09767
1
citations
Reinforced Context Order Recovery for Adaptive Reasoning and Planning
Long Ma, Fangwei Zhong, Yizhou Wang
NEURIPS 2025posterarXiv:2508.13070
2
citations
StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs
Qijun Luo, Mengqi Li, Lei Zhao et al.
NEURIPS 2025posterarXiv:2506.03077
1
citations