Qingpei Guo
8
Papers
11
Total Citations
Papers (8)
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
CVPR 2025
11
citations
DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding
CVPR 2025
0
citations
Engage for All: Making Ordinary Image Descriptions Appealing Again!
ICCV 2025
0
citations
Social Debiasing for Fair Multi-modal LLMs
ICCV 2025
0
citations
Unified Video Generation via Next-Set Prediction in Continuous Domain
ICCV 2025
0
citations
Attributive Reasoning for Hallucination Diagnosis of Large Language Models
AAAI 2025
0
citations
Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs
CVPR 2024
0
citations
SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment
ICML 2024
0
citations