Poster "sequence length scaling" Papers
2 papers found
StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs
Qijun Luo, Mengqi Li, Lei Zhao et al.
NeurIPS 2025posterarXiv:2506.03077
1
citations
Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
Jon Saad-Falcon, Daniel Y Fu, Simran Arora et al.
ICML 2024posterarXiv:2402.07440