Peize Sun

5

Papers

110

Total Citations

Papers (5)

Goku: Flow Based Video Generative Foundation Models

PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

NeurIPS 2025arXiv

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis