Le Zhuo
8
Papers
314
Total Citations
Papers (8)
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
NeurIPS 2025arXiv
91
citations
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
CVPR 2025
54
citations
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
ICCV 2025arXiv
52
citations
LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation
ICLR 2025arXiv
34
citations
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
ICCV 2025arXiv
28
citations
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
ICLR 2025arXiv
26
citations
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning
ICCV 2025arXiv
20
citations
Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation
ICLR 2025
9
citations