Qingpei Guo
12
Papers
11
Total Citations
Papers (12)
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
CVPR 2025
11
citations
DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding
CVPR 2025
0
citations
Engage for All: Making Ordinary Image Descriptions Appealing Again!
ICCV 2025
0
citations
Social Debiasing for Fair Multi-modal LLMs
ICCV 2025
0
citations
Unified Video Generation via Next-Set Prediction in Continuous Domain
ICCV 2025
0
citations
Attributive Reasoning for Hallucination Diagnosis of Large Language Models
AAAI 2025
0
citations
Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs
CVPR 2024
0
citations
SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment
ICML 2024
0
citations
LPSNet: A Lightweight Solution for Fast Panoptic Segmentation
CVPR 2021
0
citations
CNVid-3.5M: Build, Filter, and Pre-Train the Large-Scale Public Chinese Video-Text Dataset
CVPR 2023
0
citations
Boundary-Aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval
CVPR 2023arXiv
0
citations
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input
ECCV 2022
0
citations