ICLR 2025 "state space models" Papers
3 papers found
Hymba: A Hybrid-head Architecture for Small Language Models
Xin Dong, Yonggan Fu, Shizhe Diao et al.
ICLR 2025posterarXiv:2411.13676
55
citations
Revisiting Convolution Architecture in the Realm of DNA Foundation Models
Yu Bo, Weian Mao, Daniel Shao et al.
ICLR 2025posterarXiv:2502.18538
4
citations
State Space Models are Provably Comparable to Transformers in Dynamic Token Selection
Naoki Nishikawa, Taiji Suzuki
ICLR 2025posterarXiv:2405.19036
6
citations