2025 "multimodal generation" Papers
4 papers found
Generator Matching: Generative modeling with arbitrary Markov processes
Peter Holderrieth, Marton Havasi, Jason Yim et al.
ICLR 2025posterarXiv:2410.20587
43
citations
LMFusion: Adapting Pretrained Language Models for Multimodal Generation
Weijia Shi, Xiaochuang Han, Chunting Zhou et al.
NeurIPS 2025posterarXiv:2412.15188
79
citations
RapVerse: Coherent Vocals and Whole-Body Motion Generation from Text
Jiaben Chen, Xin Yan, Yihang Chen et al.
ICCV 2025posterarXiv:2405.20336
3
citations
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
Jinheng Xie, Weijia Mao, Zechen Bai et al.
ICLR 2025posterarXiv:2408.12528
455
citations