Poster "parallel token prediction" Papers
2 papers found
DINGO: Constrained Inference for Diffusion LLMs
Tarun Suresh, Debangshu Banerjee, Shubham Ugare et al.
NeurIPS 2025posterarXiv:2505.23061
3
citations
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Tianle Cai, Yuhong Li, Zhengyang Geng et al.
ICML 2024posterarXiv:2401.10774