Zhi Gao
4
Papers
91
Total Citations
Papers (4)
CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Update
CVPR 2024
45
citations
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage
ICLR 2025
37
citations
MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge
ICLR 2025
8
citations
Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning
NeurIPS 2025arXiv
1
citations