ICLR "state space models" Papers
8 papers found
Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models
Fusheng Liu, Qianxiao Li
ICLR 2025oralarXiv:2411.19455
6
citations
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient
Wenlong Wang, Ivana Dusparic, Yucheng Shi et al.
ICLR 2025posterarXiv:2410.08893
3
citations
Hymba: A Hybrid-head Architecture for Small Language Models
Xin Dong, Yonggan Fu, Shizhe Diao et al.
ICLR 2025posterarXiv:2411.13676
55
citations
Revisiting Convolution Architecture in the Realm of DNA Foundation Models
Yu Bo, Weian Mao, Daniel Shao et al.
ICLR 2025posterarXiv:2502.18538
4
citations
RFMamba: Frequency-Aware State Space Model for RF-Based Human-Centric Perception
Rui Zhang, Ruixu Geng, Yadong Li et al.
ICLR 2025poster
2
citations
Sports-Traj: A Unified Trajectory Generation Model for Multi-Agent Movement in Sports
Yi Xu, Yun Fu
ICLR 2025oralarXiv:2405.17680
10
citations
State Space Models are Provably Comparable to Transformers in Dynamic Token Selection
Naoki Nishikawa, Taiji Suzuki
ICLR 2025posterarXiv:2405.19036
6
citations
ThunderKittens: Simple, Fast, and $\textit{Adorable}$ Kernels
Benjamin Spector, Simran Arora, Aaryan Singhal et al.
ICLR 2025poster
3
citations