"long sequence modeling" Papers

12 papers found

Filters:long sequence modeling Clear all

Conference

AAAI 2025 (3,028)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NeurIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,140)oral (1,594)spotlight (1,421)highlight (975)

2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification

Jingwei Zhang, Anh Tien Nguyen, Xi Han et al.

CVPR 2025posterarXiv:2412.00678

citations

Dendritic Resonate-and-Fire Neuron for Effective and Efficient Long Sequence Modeling

Dehao Zhang, Malu Zhang, Shuai Wang et al.

NeurIPS 2025oralarXiv:2509.17186

citations

Training Free Exponential Context Extension via Cascading KV Cache

Jeff Willette, Heejun Lee, Youngwan Lee et al.

ICLR 2025posterarXiv:2406.17808

citations

Why RoPE Struggles to Maintain Long-Term Decay in Long Sequences?

Wei Shen, Chao Yin, Yuliang Liu et al.

ICLR 2025poster

ZETA: Leveraging $Z$-order Curves for Efficient Top-$k$ Attention

Qiuhao Zeng, Jierui Huang, Peng Lu et al.

ICLR 2025posterarXiv:2501.14577

citations

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Yiran Ding, Li Lyna Zhang, Chengruidong Zhang et al.

ICML 2024poster

Motion Mamba: Efficient and Long Sequence Motion Generation

Zeyu Zhang, Akide Liu, Ian Reid et al.

ECCV 2024posterarXiv:2403.07487

108

citations

MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling

Jiaqi Xu, Bo Liu, Yunkuo Chen et al.

AAAI 2024paperarXiv:2303.05707

citations

SeTformer Is What You Need for Vision and Language

Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger et al.

AAAI 2024paperarXiv:2401.03540

citations

State-Free Inference of State-Space Models: The Transfer Function Approach

Rom N. Parnichkun, Stefano Massaroli, Alessandro Moro et al.

ICML 2024poster

Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention

Zhen Qin, Weigao Sun, Dong Li et al.

ICML 2024poster

xT: Nested Tokenization for Larger Context in Large Images

Ritwik Gupta, Shufan Li, Tyler Zhu et al.

ICML 2024poster

"long sequence modeling" Papers

Conference

Paper Type

2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification

Dendritic Resonate-and-Fire Neuron for Effective and Efficient Long Sequence Modeling

Training Free Exponential Context Extension via Cascading KV Cache

Why RoPE Struggles to Maintain Long-Term Decay in Long Sequences?

ZETA: Leveraging $Z$-order Curves for Efficient Top-$k$ Attention

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Motion Mamba: Efficient and Long Sequence Motion Generation

MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling

SeTformer Is What You Need for Vision and Language

State-Free Inference of State-Space Models: The *Transfer Function* Approach

Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention

xT: Nested Tokenization for Larger Context in Large Images

State-Free Inference of State-Space Models: The Transfer Function Approach