CVPR "multimodal language models" Papers
2 papers found
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model
Benlin Liu, Yuhao Dong, Yiqin Wang et al.
CVPR 2025posterarXiv:2408.00754
9
citations
StarVector: Generating Scalable Vector Graphics Code from Images and Text
Juan Rodriguez, Abhay Puri, Shubham Agarwal et al.
CVPR 2025posterarXiv:2312.11556
30
citations