2025 "sequence length reduction" Papers
2 papers found
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
Chenze Shao, Fandong Meng, Jie Zhou
ICLR 2025posterarXiv:2407.12665
2
citations
RHYTHM: Reasoning with Hierarchical Temporal Tokenization for Human Mobility
Haoyu He, Haozheng Luo, Yan Chen et al.
NeurIPS 2025oralarXiv:2509.23115
1
citations