Di Zhang
6
Papers
152
Total Citations
Papers (6)
Improving Video Generation with Human Feedback
NeurIPS 2025
106
citations
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning
CVPR 2025arXiv
30
citations
OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers
NeurIPS 2025
12
citations
MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding
ICML 2025
4
citations
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
AAAI 2025
0
citations
CERTAIN: Context Uncertainty-aware One-Shot Adaptation for Context-based Offline Meta Reinforcement Learning
ICML 2025
0
citations