2025 Poster "speculative decoding" Papers
14 papers found
Approximately Aligned Decoding
Daniel Melcer, Sujan Kumar Gonugondla, Pramuditha Perera et al.
NeurIPS 2025posterarXiv:2410.01103
2
citations
Block Verification Accelerates Speculative Decoding
Ziteng Sun, Uri Mendlovic, Yaniv Leviathan et al.
ICLR 2025posterarXiv:2403.10444
18
citations
EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization
Yize Wu, KE GAO, Ling Li et al.
NeurIPS 2025posterarXiv:2502.02493
1
citations
GRIFFIN: Effective Token Alignment for Faster Speculative Decoding
Shijing Hu, Jingyang Li, Xingyu Xie et al.
NeurIPS 2025posterarXiv:2502.11018
3
citations
Grouped Speculative Decoding for Autoregressive Image Generation
Junhyuk So, Juncheol Shin, Hyunho Kook et al.
ICCV 2025posterarXiv:2508.07747
3
citations
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding
Ranajoy Sadhukhan, Jian Chen, Zhuoming Chen et al.
ICLR 2025posterarXiv:2408.11049
61
citations
SpecEM: Training-Free LLM Ensembling via Iterative Drafting, Verification, and Online Feedback
Bo Lv, Nayu Liu, Chen Tang et al.
NeurIPS 2025poster
SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
Rui Pan, Yinwei Dai, Zhihao Zhang et al.
NeurIPS 2025posterarXiv:2504.07891
35
citations
Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation
Yao Teng, Fu-Yun Wang, Xian Liu et al.
NeurIPS 2025posterarXiv:2510.08994
Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting
Zilong (Ryan) Wang, Zifeng Wang, Long Le et al.
ICLR 2025posterarXiv:2407.08223
75
citations
SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
Heming Xia, Yongqi Li, Jun Zhang et al.
ICLR 2025posterarXiv:2410.06916
39
citations
Towards Better & Faster Autoregressive Image Generation: From the Perspective of Entropy
Xiaoxiao Ma, Feng Zhao, Pengyang Ling et al.
NeurIPS 2025posterarXiv:2510.09012
3
citations
Towards Optimal Multi-draft Speculative Decoding
Zhengmian Hu, Tong Zheng, Vignesh Viswanathan et al.
ICLR 2025posterarXiv:2502.18779
11
citations
ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding
Jialiang Kang, Han Shu, Wenshuo Li et al.
NeurIPS 2025posterarXiv:2509.15235
2
citations