2025 "semantic alignment" Papers
14 papers found
Adaptive and Multi-scale Affinity Alignment for Hierarchical Contrastive Learning
Jiawei Huang, Minming Li, Hu Ding
NeurIPS 2025poster
CREA: A Collaborative Multi-Agent Framework for Creative Image Editing and Generation
Kavana Venkatesh, Connor Dunlop, Pinar Yanardag
NeurIPS 2025posterarXiv:2504.05306
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
Keon Lee, Dong Won Kim, Jaehyeon Kim et al.
ICLR 2025posterarXiv:2406.11427
28
citations
GSAlign: Geometric and Semantic Alignment Network for Aerial-Ground Person Re-Identification
Qiao Li, Jie Li, Yukang Zhang et al.
NeurIPS 2025posterarXiv:2510.22268
1
citations
Layered Image Vectorization via Semantic Simplification
Zhenyu Wang, Jianxi Huang, Zhida Sun et al.
CVPR 2025posterarXiv:2406.05404
9
citations
Learning a Cross-Modal Schrödinger Bridge for Visual Domain Generalization
Hao Zheng, Jingjun Yi, Qi Bi et al.
NeurIPS 2025poster
MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation
Jiaxin Huang, Runnan Chen, Ziwen Li et al.
NeurIPS 2025posterarXiv:2503.18135
8
citations
OmniZoom: A Universal Plug-and-Play Paradigm for Cross-Device Smooth Zoom Interpolation
Xiaoan Zhu, Yue Zhao, Tianyang Hu et al.
NeurIPS 2025poster
OOD-Barrier: Build a Middle-Barrier for Open-Set Single-Image Test Time Adaptation via Vision Language Models
Boyang Peng, Sanqing Qu, Tianpei Zou et al.
NeurIPS 2025poster
Rebalancing Contrastive Alignment with Bottlenecked Semantic Increments in Text-Video Retrieval
Jian Xiao, Zijie Song, Jialong Hu et al.
NeurIPS 2025posterarXiv:2505.12499
RespoDiff: Dual-Module Bottleneck Transformation for Responsible & Faithful T2I Generation
Silpa Vadakkeeveetil Sreelatha, Sauradip Nag, Muhammad Awais et al.
NeurIPS 2025posterarXiv:2509.15257
SAGI: Semantically Aligned and Uncertainty Guided AI Image Inpainting
Paschalis Giakoumoglou, Dimitrios Karageorgiou, Symeon Papadopoulos et al.
ICCV 2025posterarXiv:2502.06593
2
citations
Towards Semantic Equivalence of Tokenization in Multimodal LLM
Shengqiong Wu, Hao Fei, Xiangtai Li et al.
ICLR 2025posterarXiv:2406.05127
58
citations
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
Runtao Liu, Haoyu Wu, Zheng Ziqiang et al.
CVPR 2025posterarXiv:2412.14167
68
citations