Yibing Song
9
Papers
361
Total Citations
Papers (9)
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step
ICCV 2025
338
citations
Image Inpainting via Iteratively Decoupled Probabilistic Modeling
ICLR 2024
17
citations
Re-Aligning Language to Visual Objects with an Agentic Workflow
ICLR 2025
3
citations
CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-step
NeurIPS 2025
3
citations
A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs
CVPR 2025
0
citations
AvatarArtist: Open-Domain 4D Avatarization
CVPR 2025
0
citations
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation
CVPR 2025
0
citations
Advancing Textual Prompt Learning with Anchored Attributes
ICCV 2025
0
citations
Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows
CVPR 2025
0
citations