Weigao Sun
5
Papers
30
Total Citations
Papers (5)
CO2: Efficient Distributed Training with Full Communication-Computation Overlap
ICLR 2024
15
citations
Liger: Linearizing Large Language Models to Gated Recurrent Structures
ICML 2025
11
citations
Sequence Accumulation and Beyond: Infinite Context Length on Single GPU and Large Clusters
AAAI 2025
3
citations
Improving Bilinear RNN with Closed-loop Control
NeurIPS 2025
1
citations
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention
ICML 2024
0
citations