2025 "auto-regressive decoding" Papers
2 papers found
Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads
Zhoutong Wu, Yuan Zhang, Yiming Dong et al.
NeurIPS 2025posterarXiv:2510.16807
Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models
Fushuo Huo, Wenchao Xu, Zhong Zhang et al.
ICLR 2025posterarXiv:2408.02032
61
citations