"audio-visual question answering" Papers
2 papers found
PAVE: Patching and Adapting Video Large Language Models
Zhuoming Liu, Yiquan Li, Khoi D Nguyen et al.
CVPR 2025posterarXiv:2503.19794
1
citations
Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering
Zhangbin Li, Jinxing Zhou, Dan Guo et al.
AAAI 2024paperarXiv:2312.12816
24
citations