Ziqiao Ma
5
Papers
148
Total Citations
Papers (5)
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation
CVPR 2024
75
citations
Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference under Ambiguities
ICLR 2025arXiv
40
citations
Inversion-Free Image Editing with Language-Guided Diffusion Models
CVPR 2024
32
citations
SimWorld: An Open-ended Simulator for Agents in Physical and Social Worlds
NeurIPS 2025
1
citations
VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation
ICCV 2025
0
citations