Zhengrong Yue
4
Papers
18
Total Citations
Papers (4)
VideoChat-R1.5: Visual Test-Time Scaling to Reinforce Multimodal Reasoning by Iterative Perception
NeurIPS 2025arXiv
13
citations
V-Stylist: Video Stylization via Collaboration and Reflection of MLLM Agents
CVPR 2025
5
citations
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
ICCV 2025
0
citations
Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration
AAAI 2025
0
citations