AAAI "long-range dependencies" Papers
4 papers found
Large Language Model Meets Graph Neural Network in Knowledge Distillation
Shengxiang Hu, Guobing Zou, Song Yang et al.
AAAI 2025paperarXiv:2402.05894
14
citations
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
Meng Lou, Yunxiang Fu, Yizhou Yu
AAAI 2025paperarXiv:2409.09649
24
citations
Cached Transformers: Improving Transformers with Differentiable Memory Cached
Zhaoyang Zhang, Wenqi Shao, Yixiao Ge et al.
AAAI 2024paperarXiv:2312.12742
5
citations
S2WAT: Image Style Transfer via Hierarchical Vision Transformer Using Strips Window Attention
Chiyu Zhang, Xiaogang Xu, Lei Wang et al.
AAAI 2024paperarXiv:2210.12381
46
citations