Xiaohan Zhang
6
Papers
1,646
Total Citations
2
h-index
Papers (6)
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
ICLR 2025
1,318
citations
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
208
citations
KoLA: Carefully Benchmarking World Knowledge of Large Language Models
ICLR 2024
85
citations
Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition
AAAI 2024arXiv
33
citations
Toy-GS: Assembling Local Gaussians for Precisely Rendering Large-Scale Free Camera Trajectories
AAAI 2025
2
citations
OpenEQA: Embodied Question Answering in the Era of Foundation Models
CVPR 2024
0
citations