ICLR Poster "decoder-only transformers" Papers
2 papers found
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Md Rifat Arefin, Gopeshh Raaj Subbaraj, Nicolas Gontier et al.
ICLR 2025posterarXiv:2411.02344
4
citations
Towards Neural Scaling Laws for Time Series Foundation Models
Qingren Yao, Chao-Han Huck Yang, Renhe Jiang et al.
ICLR 2025posterarXiv:2410.12360