"multimodal comprehension" Papers
3 papers found
GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs
Yi Fang, Bowen Jin, Jiacheng Shen et al.
CVPR 2025posterarXiv:2502.11925
3
citations
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE
YONGWEI CHEN, Yushi Lan, Shangchen Zhou et al.
CVPR 2025posterarXiv:2411.16856
21
citations
Auto-Encoding Morph-Tokens for Multimodal LLM
Kaihang Pan, Siliang Tang, Juncheng Li et al.
ICML 2024spotlight