Paper "long-range dependencies" Papers
2 papers found
Cached Transformers: Improving Transformers with Differentiable Memory Cached
Zhaoyang Zhang, Wenqi Shao, Yixiao Ge et al.
AAAI 2024paperarXiv:2312.12742
5
citations
S2WAT: Image Style Transfer via Hierarchical Vision Transformer Using Strips Window Attention
Chiyu Zhang, Xiaogang Xu, Lei Wang et al.
AAAI 2024paperarXiv:2210.12381
46
citations