AAAI 2024 "cross-attention mechanisms" Papers
2 papers found
Commonsense for Zero-Shot Natural Language Video Localization
Meghana Holla, Ismini Lourentzou
AAAI 2024paperarXiv:2312.17429
5
citations
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models
Ruichen Wang, Zekang Chen, Chen Chen et al.
AAAI 2024paperarXiv:2305.13921
92
citations