Di ZHANG
18
Papers
174
Total Citations
Papers (18)
Learning Multi-Dimensional Human Preference for Text-to-Image Generation
CVPR 2024
76
citations
GameFactory: Creating New Games with Generative Interactive Videos
ICCV 2025
63
citations
Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation
CVPR 2025
17
citations
SketchVideo: Sketch-based Video Generation and Editing
CVPR 2025
8
citations
GPAvatar: High-fidelity Head Avatars by Learning Efficient Gaussian Projections
CVPR 2025
3
citations
GGTalker: Talking Head Systhesis with Generalizable Gaussian Priors and Identity-Specific Adaptation
ICCV 2025
3
citations
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution
CVPR 2025
3
citations
Libra-Merging: Importance-redundancy and Pruning-merging Trade-off for Acceleration Plug-in in Large Vision-Language Model
CVPR 2025
1
citations
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion
ICCV 2025
0
citations
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
ICCV 2025
0
citations
How Far are AI-generated Videos from Simulating the 3D Visual World: A Learned 3D Evaluation Approach
ICCV 2025
0
citations
Scene Graph Guided Generation: Enable Accurate Relations Generation in Text-to-Image Models via Textural Rectification
ICCV 2025
0
citations
FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention
ICCV 2025
0
citations
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
ICML 2024
0
citations
StyleMaster: Stylize Your Video with Artistic Generation and Translation
CVPR 2025
0
citations
Towards Precise Scaling Laws for Video Diffusion Transformers
CVPR 2025
0
citations
Imbalance in Balance: Online Concept Balancing in Generation Models
ICCV 2025
0
citations
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content
CVPR 2025
0
citations