NEURIPS 2025 "multimodal large language model" Papers
3 papers found
JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent
Yunlong Lin, Zixu Lin, Kunjie Lin et al.
NEURIPS 2025posterarXiv:2506.17612
9
citations
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
Kai Liu, Jungang Li, Yuchong Sun et al.
NEURIPS 2025oralarXiv:2512.22905
4
citations
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
Yicheng Xiao, Lin Song, Yukang Chen et al.
NEURIPS 2025posterarXiv:2505.13031
19
citations