Falcon: Faster and Parallel Inference of Large Language Models Through Enhanced Semi-Autoregressive Drafting and Custom-Designed Decoding Tree

15citations
PDFProject
15
Citations
#136
in AAAI 2025
of 3028 papers
4
Authors
1
Data Points

Citation History

Jan 27, 2026
15