NeurIPS 2025 "inference efficiency" Papers
4 papers found
Can LLMs Outshine Conventional Recommenders? A Comparative Evaluation
Qijiong Liu, Jieming Zhu, Lu Fan et al.
NeurIPS 2025posterarXiv:2503.05493
4
citations
Depth-Width Tradeoffs for Transformers on Graph Tasks
Gilad Yehudai, Clayton Sanford, Maya Bechler-Speicher et al.
NeurIPS 2025spotlight
RepLDM: Reprogramming Pretrained Latent Diffusion Models for High-Quality, High-Efficiency, High-Resolution Image Generation
Boyuan Cao, Jiaxin Ye, Yujie Wei et al.
NeurIPS 2025spotlightarXiv:2410.06055
9
citations
Týr-the-Pruner: Structural Pruning LLMs via Global Sparsity Distribution Optimization
Guanchen Li, Yixing Xu, Zeping Li et al.
NeurIPS 2025posterarXiv:2503.09657
6
citations