"multi-modal transformer" Papers
3 papers found
UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Lunhao Duan, Shanshan Zhao, Wenjun Yan et al.
CVPR 2025posterarXiv:2412.18928
7
citations
DocFormerv2: Local Features for Document Understanding
Srikar Appalaraju, Peng Tang, Qi Dong et al.
AAAI 2024paperarXiv:2306.01733
58
citations
Probabilistic Image-Driven Traffic Modeling via Remote Sensing
Scott Workman, Armin Hadzic
ECCV 2024posterarXiv:2403.05521