Poster "decoder-only transformer" Papers
5 papers found
EAReranker: Efficient Embedding Adequacy Assessment for Retrieval Augmented Generation
Dongyang Zeng, Yaping Liu, Wei Zhang et al.
NeurIPS 2025poster
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts
Xiaoming Shi, Shiyu Wang, Yuqi Nie et al.
ICLR 2025posterarXiv:2409.16040
178
citations
Denoising Autoregressive Representation Learning
Yazhe Li, Jorg Bornschein, Ting Chen
ICML 2024poster
StableMask: Refining Causal Masking in Decoder-only Transformer
Qingyu Yin, Xuzheng He, Xiang Zhuang et al.
ICML 2024poster
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk, Lijun Yu, Xiuye Gu et al.
ICML 2024poster