Zhi Gao

4

Papers

91

Total Citations

Papers (4)

CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Update

Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage

MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge

Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning

NeurIPS 2025arXiv