2025 "multi-stage training" Papers
2 papers found
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding
Chenxin Tao, Shiqian Su, Xizhou Zhu et al.
CVPR 2025posterarXiv:2412.16158
5
citations
OptiScene: LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization
Yixuan Yang, Zhen Luo, Tongsheng Ding et al.
NEURIPS 2025posterarXiv:2506.07570
4
citations