Highlight "image captioning" Papers
3 papers found
Conference
Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions
Tommaso Galliena, Tommaso Apicella, Stefano Rosa et al.
ICCV 2025highlightarXiv:2504.08531
1
citations
Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution
Qihao Liu, Xi Yin, Alan L. Yuille et al.
CVPR 2025highlightarXiv:2412.15213
12
citations
Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Zhang Li, Biao Yang, Qiang Liu et al.
CVPR 2024highlightarXiv:2311.06607
392
citations