Dongxu Li

3

Papers

0

Total Citations

Papers (3)

EZSR: Event-based Zero-Shot Recognition

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning