Ming-Yu Liu
11
Papers
393
Total Citations
Papers (11)
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
CVPR 2025
203
citations
Describe Anything: Detailed Localized Image and Video Captioning
ICCV 2025
49
citations
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
CVPR 2024
47
citations
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation
CVPR 2024
33
citations
Condition-Aware Neural Network for Controlled Image Generation
CVPR 2024
17
citations
Efficient Part-level 3D Object Generation via Dual Volume Packing
NeurIPS 2025arXiv
16
citations
Dynamic Camera Poses and Where to Find Them
CVPR 2025arXiv
15
citations
ArtiScene: Language-Driven Artistic 3D Scene Generation Through Image Intermediary
CVPR 2025arXiv
6
citations
A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation
CVPR 2025
4
citations
Articulated Kinematics Distillation from Video Diffusion Models
CVPR 2025arXiv
3
citations
HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
CVPR 2025
0
citations