Dongxu Li
3
Papers
0
Total Citations
Papers (3)
EZSR: Event-based Zero-Shot Recognition
CVPR 2025arXiv
0
citations
VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
CVPR 2025arXiv
0
citations
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning
ECCV 2024
0
citations