Peize Sun
5
Papers
110
Total Citations
Papers (5)
Goku: Flow Based Video Generative Foundation Models
CVPR 2025arXiv
53
citations
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
NeurIPS 2025arXiv
40
citations
Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
ICCV 2025
10
citations
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
ICLR 2025
7
citations
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
ICML 2024
0
citations