Li Zhou
4
Papers
16
Total Citations
Papers (4)
Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model
AAAI 2025
7
citations
MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
CVPR 2025
6
citations
INTER: Mitigating Hallucination in Large Vision-Language Models by Interaction Guidance Sampling
ICCV 2025arXiv
3
citations
Engage for All: Making Ordinary Image Descriptions Appealing Again!
ICCV 2025
0
citations