by Jianqiao Lu Papers
4 papers found
DeltaFormer: Unlock the state space of Transformer
Mingyu Xu, Tenglong Ao, Jiaao He et al.
NeurIPS 2025poster
FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference
Xunhao Lai, Jianqiao Lu, Yao Luo et al.
ICLR 2025posterarXiv:2502.20766
51
citations
FormalAlign: Automated Alignment Evaluation for Autoformalization
Jianqiao Lu, Yingjia Wan, Yinya Huang et al.
ICLR 2025poster
Model Merging in Pre-training of Large Language Models
Yunshui Li, Yiyuan Ma, Shen Yan et al.
NeurIPS 2025poster