2024 "multimodal tasks" Papers
2 papers found
KAM-CoT: Knowledge Augmented Multimodal Chain-of-Thoughts Reasoning
Debjyoti Mondal, Suraj Modi, Subhadarshi Panda et al.
AAAI 2024paperarXiv:2401.12863
78
citations
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models
Hao Cheng, Erjia Xiao, Jindong Gu et al.
ECCV 2024posterarXiv:2402.19150
15
citations