Li Zhou
6
Papers
16
Total Citations
Papers (6)
Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model
AAAI 2025
7
citations
MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
CVPR 2025
6
citations
INTER: Mitigating Hallucination in Large Vision-Language Models by Interaction Guidance Sampling
ICCV 2025arXiv
3
citations
Engage for All: Making Ordinary Image Descriptions Appealing Again!
ICCV 2025
0
citations
Learning for Disparity Estimation Through Feature Constancy
CVPR 2018arXiv
0
citations
Joint Visual Grounding and Tracking With Natural Language Specification
CVPR 2023arXiv
0
citations