Peize Sun
5
Papers
17
Total Citations
Papers (5)
Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
ICCV 2025arXiv
10
citations
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
ICLR 2025arXiv
7
citations
Goku: Flow Based Video Generative Foundation Models
CVPR 2025arXiv
0
citations
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
NeurIPS 2025arXiv
0
citations
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
ICML 2024
0
citations