ICML "long sequence modeling" Papers
4 papers found
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Yiran Ding, Li Lyna Zhang, Chengruidong Zhang et al.
ICML 2024posterarXiv:2402.13753
State-Free Inference of State-Space Models: The *Transfer Function* Approach
Rom N. Parnichkun, Stefano Massaroli, Alessandro Moro et al.
ICML 2024poster
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention
Zhen Qin, Weigao Sun, Dong Li et al.
ICML 2024posterarXiv:2405.17381
xT: Nested Tokenization for Larger Context in Large Images
Ritwik Gupta, Shufan Li, Tyler Zhu et al.
ICML 2024posterarXiv:2403.01915