Can Qin
6
Papers
192
Total Citations
Papers (6)
HIVE: Harnessing Human Feedback for Instructional Visual Editing
CVPR 2024
164
citations
HoliTom: Holistic Token Merging for Fast Video Large Language Models
NeurIPS 2025
16
citations
M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking
AAAI 2024arXiv
12
citations
DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models
CVPR 2025
0
citations
Disentangled Pose and Appearance Guidance for Multi-Pose Generation
CVPR 2025
0
citations
Structured Policy Optimization: Enhance Large Vision-Language Model via Self-referenced Dialogue
ICCV 2025
0
citations