NEURIPS "language modeling" Papers
10 papers found
Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking
Heli Ben-Hamu, Itai Gat, Daniel Severo et al.
NEURIPS 2025posterarXiv:2505.24857
40
citations
Continuous Diffusion Model for Language Modeling
Jaehyeong Jo, Sung Ju Hwang
NEURIPS 2025posterarXiv:2502.11564
4
citations
DeltaProduct: Improving State-Tracking in Linear RNNs via Householder Products
Julien Siems, Timur Carstensen, Arber Zela et al.
NEURIPS 2025posterarXiv:2502.10297
23
citations
Flexible Language Modeling in Continuous Space with Transformer-based Autoregressive Flows
Ruixiang Zhang, Shuangfei Zhai, Jiatao Gu et al.
NEURIPS 2025posterarXiv:2507.00425
4
citations
From Bytes to Ideas: Language Modeling with Autoregressive U-Nets
Mathurin VIDEAU, Badr Youbi Idrissi, Alessandro Leite et al.
NEURIPS 2025posterarXiv:2506.14761
5
citations
Generalized Gradient Norm Clipping & Non-Euclidean $(L_0,L_1)$-Smoothness
Thomas Pethick, Wanyun Xie, Mete Erdogan et al.
NEURIPS 2025oralarXiv:2506.01913
7
citations
Improving Bilinear RNN with Closed-loop Control
Jiaxi Hu, Yongqi Pan, Jusen Du et al.
NEURIPS 2025spotlightarXiv:2506.02475
3
citations
Nested Learning: The Illusion of Deep Learning Architectures
Ali Behrouz, Meisam Razaviyayn, Peilin Zhong et al.
NEURIPS 2025posterarXiv:2512.24695
12
citations
Next Semantic Scale Prediction via Hierarchical Diffusion Language Models
Cai Zhou, Chenyu Wang, Dinghuai Zhang et al.
NEURIPS 2025posterarXiv:2510.08632
3
citations
ShortListing Model: A Streamlined Simplex Diffusion for Discrete Variable Generation
Yuxuan Song, Zhe Zhang, Yu Pei et al.
NEURIPS 2025poster
1
citations