Pandeng Li
9
Papers
76
Total Citations
Papers (9)
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval
AAAI 2024arXiv
40
citations
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models
CVPR 2025
14
citations
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
NeurIPS 2025arXiv
14
citations
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
ECCV 2024
5
citations
CAPability: A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness
NeurIPS 2025
3
citations
CLIP-Adapted Region-to-Text Learning for Generative Open-Vocabulary Semantic Segmentation
ICCV 2025
0
citations
Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval
ICCV 2023
0
citations
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval
ECCV 2022
0
citations
MomentDiff: Generative Video Moment Retrieval from Random to Real
NeurIPS 2023
0
citations