Pandeng Li
7
Papers
76
Total Citations
Papers (7)
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval
AAAI 2024arXiv
40
citations
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
NeurIPS 2025arXiv
14
citations
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models
CVPR 2025
14
citations
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
ECCV 2024
5
citations
CAPability: A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness
NeurIPS 2025
3
citations
FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors
ECCV 2024
0
citations
CLIP-Adapted Region-to-Text Learning for Generative Open-Vocabulary Semantic Segmentation
ICCV 2025
0
citations