2024 "semantic alignment" Papers
8 papers found
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models
Ruichen Wang, Zekang Chen, Chen Chen et al.
AAAI 2024paperarXiv:2305.13921
92
citations
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Xuelu Feng, Dongdong Chen, Junsong Yuan et al.
ECCV 2024posterarXiv:2403.12042
17
citations
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Jinlu Zhang, Yiyi Zhou, Qiancheng Zheng et al.
ICML 2024poster
GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections
Shiyue Zhang, Zheng Chong, Xujie Zhang et al.
ECCV 2024posterarXiv:2408.12352
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Ling Yang, Zhaochen Yu, Chenlin Meng et al.
ICML 2024poster
Prioritized Semantic Learning for Zero-shot Instance Navigation
Xinyu Sun, Lizhao Liu, Hongyan Zhi et al.
ECCV 2024posterarXiv:2403.11650
22
citations
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan, Renrui Zhang, Ziyu Guo et al.
AAAI 2024paperarXiv:2305.16318
58
citations
Semantic Lens: Instance-Centric Semantic Alignment for Video Super-resolution
AAAI 2024paperarXiv:2312.07823